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PREFACE 


This is an expanded version of a course given for a number of years at the 
California Institute of Technology. It was designed for applied mathema- 
tics students in the first and second years of graduate study; it appears to 
have been equally useful for students in engineering and physics. 

The presentation is intended to be self-contained but both the order 
chosen for the topics and the level adopted suppose previous experience 
with the elementary aspects of linear wave propagation. The aim is to 
cover all the major well-established ideas but, at the same time, to 
emphasize nonlinear theory from the outset and to introduce the very 
active research areas in this field. The material covered is outlined in detail 
in Chapter 1. The mathematical development of the subject is combined 
with considerable discussion of applications. For the most part previous 
detailed knowledge of a field of application is not assumed; the relevant 
physical ideas and derivation of basic equations are given in depth. The 
specific mathematical background required is familiarity with transform 
techniques, methods for the asymptotic expansion of integrals, solutions of 
standard boundary value problems and the related topics that are usually 
referred to collectively as “mathematical methods.” 

Parts of the account are drawn from research supported over the last 


_ several years by the Office of Naval Research. It is a pleasure to express 


my gratitude to the people there, particularly to Leila Bram and Stuart 
Brodsky. 

My special thanks to Vivian Davies and Deborah Massey who typed the 
manuscript and cheerfully put up with my constant rewrites and changes. 
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LINEAR AND 
NONLINEAR WAVES 


CHAPTER 1 


Introduction and General Outline 


Wave motion is one of the broadest scientific subjects and unusual in 
that it can be studied at any technical level. The behavior of water waves 
and the propagation characteristics of light and sound are familiar from 
everyday experience. Modern problems such as sonic booms or moving 
bottlenecks in traffic are necessarily of general interest. All these can be 
appreciated in a descriptive way without any technical knowledge. On the 
other hand they are also intensively studied by specialists, and almost any 
field of science or engineering involves some questions of wave motion. 

There has been a correspondingly rich development of mathematical 
concepts and techniques to understand the phenomena from the theoreti- 
cal standpoint and to solve the problems that arise. The details in any 
particular application may be different and some topics will have their own 
unique twists, but a fairly general overall view has been developed. This 
book is an account of the underlying mathematical theory with emphasis 
on the unifying ideas and the main points that illuminate the behavior of 
waves. Most of the typical techniques for solving problems are presented, 
but these are not pursued beyond the point where they cease to give 
information about the nature of waves and become exercises in 
*‘mathematical methods,” difficult and intriguing as these may be. This 
applies particularly to linear wave problems. Important and fundamental 
properties of linear theory which are basic to the understanding of waves 
- must be covered. But one could then fill volumes with solutions and 
techniques for specific problems. This is not the purpose of the book. 
Although the basic material on linear waves is included, some previous 
experience with linear theory is assumed and the emphasis is on the 
conceptually more difficult nonlinear theory. The study of nonlinear waves 
started over a hundred years ago with the pioneering work of Stokes (1847) 
and Riemann (1858), and it has proceeded at an accelerating pace, with 
considerable development in recent years. The purpose here is to give a 
unified treatment of this body of material. 

The mathematical ideas are liberally interspersed with discussion of 
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specific cases and specific physical fields. Particularly in nonlinear prob- 
lems this is essential for stimulation and illumination of the correct 
mathematical arguments, and, in any case, it makes the subject more 
interesting. Many of these topics are related to some branch of fluid 
mechanics, or to examples such as traffic flow which are treated m 
analogous fashion. This is unavoidable, since the main ideas of nonlinear 
waves were developed in these subjects, although it doubtless also reflects 
personal interest and experience. But the account is not written specifically 
for fluid dynamicists. The ideas are presented in general, and topics for 
application or motivation are chosen with a general reader in mind. It is 
assumed that flood waves in rivers, waves in glaciers, traffic flow, sonic 
booms, blast waves, ocean waves from storms, and so on, are of universal 
interest. Other fields are not excluded, and detailed discussion is given, for 
example, of nonlinear optics and waves in various mechanical systems. On 
the whole, though, it seemed better in applications to concentrate in a 
nontrivial way on representative areas, rather than to present superficial 
applications to sets of equations merely quoted from every conceivable 
field. 

The book is divided into two parts, the first on hyperbolic waves and 
the second on dispersive waves. The distinction will be explained in the 
next section. In Part I the basic ideas are presented in Chapters 2, 5, 7, 
while in Part II they appear in Chapters 11, 14, 15, 17. The intervening 
chapters amplify the general ideas in specific contexts and may be read in 
full or sampled according to the reader’s interests. It should also be 
possible to proceed directly to Part II from Chapter 2. 


1.1 The Two Main Classes of Wave Motion 


There appears to be no single precise definition of what exactly 
constitutes a wave. Various restrictive definitions can be given, but to 
cover the whole range of wave phenomena it seems preferable to be guided 
by the intuitive view that a wave is any recognizable signal that is 
transferred from one part of the medium to another with a recognizable 
velocity of propagation. The signal may be any feature of the disturbance, 
such as a maximum or an abrupt change in some quantity, provided that it 
can be clearly recognized and its location at any time can be determined. 
The signal may distort, change its magnitude, and change its velocity 
provided it is still recognizable. This may seem a little vague, but it turns 
out to be perfectly adequate and any attempt to be more precise appears to 
be too restrictive; different features are important in different types of 
wave. 
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Nevertheless, one can distinguish two main classes. The first is formu- 
lated mathematically in terms of hyperbolic partial differential equations, 
and such waves will be referred to as hyperbolic. The second class cannot 
be characterized as easily, but since it starts from the simplest cases of 
dispersive waves in linear problems, we shall refer to the whole class as 
dispersive and slowly build up a more complete picture. The classes are not 
exclusive. There is some overlap in that certain wave motions exhibit both 
types of behavior, and there are certain exceptions that fit neither. 

The prototype for hyperbolic waves is often taken to be the wave 
equation 


although the equation 
φ, + Cop, =0 (1.2) 


is, in fact, the simplest of all. As will be seen, there is a precise definition 
for hyperbolic equations which depends only on the form of the equations 
and is independent of whether explicit solutions can be obtained or not. 
On the other hand, the prototype for dispersive waves is based on a type of 
solution rather than a type of equation. A linear dispersive system is any 
system which admits solutions of the form 


p= acos (xx -- οἱ), (1.3) 


where the frequency ὦ is a definite real function of the wave number κ and 
the function w(x) is determined by the particular system. The phase speed 
is then w(x)/« and the waves are usually said to be “dispersive” if this 
phase speed is not a constant but depends on κ. The term refers to the fact 
that a more general solution will consist of the superposition of several 
modes like (1.3) with different «. [In the most general case a Fourier 
integral is developed from (1.3).] If the phase speed w/« is not the same for 
all x, that is, oc, where cy is some constant, the modes with different « 
will propagate at different speeds; they will disperse. It is convenient to 
modify the definition slightly and say that (1.3) is dispersive if w’(k) is not 
constant, that is, w”(«)#0. 

It should be noted that (1.3) is also a solution of the hyperbolic 
equation (1.2) with w=c x, or of (1.1) with w= + cok. But these cases are 
excluded from the dispersive classification by the condition w” #0. How- 
ever, it is not hard to find cases of genuine overlap in which the equations 
are hyperbolic and yet have solutions (1.3) with nontrivial dispersion 
relations w=w(k). One such example is the Klein-Gordon equation 


Pi — Pxx + P=. (1.4) 
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It is hyperbolic and yet (1.3) is a solution with w*=«?+1. This dual 
behavior is limited to relatively few instances and should not be allowed to 
obscure the overall differences between the two main classes. It does 
perhaps contribute to a fairly common misunderstanding, particularly 
encouraged in mathematical books, that wave motion is synonymous with 
hyperbolic equations and (1.3) is a less sophisticated approach to the same 
thing. The true emphasis should probably be the other way round. Rich 
and various as the class of hyperbolic waves may be, it is probably fair to 
say that the majority of wave motions fall into the dispersive class. The 
most familiar of all, ocean waves, is a dispersive case governed by 
Laplace’s equation with strange boundary conditions at the free surface! 

The first part of this book is devoted to hyperbolic waves and the 
second to dispersive waves. The theory of hyperbolic waves enters again 
into the study of dispersive waves in various curious ways, so the second 
part is not entirely independent of the first. The remainder of this chapter 
is an outline of the various themes, most of which are taken up in detail in 
the remainder of the book. The purpose is to introduce the material, but at 
the same time to give an overall view which is extracted from the detailed 
account. 


1.2 Hyperbolic Waves 


The wave equation (1.1) arises in acoustics, elasticity, and 
electromagnetism, and its basic properties and solutions were first devel- 
oped in these areas of classical physics. In all cases, however, this is not 
the whole story. 

In acoustics, one starts with the equations for a compressible fluid. 
Even if viscosity and heat conduction are neglected, this is a set of 
nonlinear equations in the velocity vector u, the density p, and pressure p. 
Acoustics refers to the approximate linear theory in which all the distur- 
bances are assumed to be small perturbations to an ambient constant state 
in which u=0, p= po, p = Po. The equations are linearized by retaining only 
first order terms in the small quantities u, p—p 9, p — Po, that is, all powers 
higher than the first and all products of small quantities are omitted. It can 
then be shown that each component of u and the perturbations p— po, 
P—Ppo satisfy the wave equation (1.1). Once this has been solved for the 
appropriate boundary conditions or initial conditions that provide the 
source of the sound, it is natural to ask various questions about how this 
solution relates to the original nonlinear equations. Even for such weak 
perturbations, are the linear results accurate and are any important qual- 
itative features lost in the approximation? If the disturbances are not 
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weak, as in explosions or in the disturbances caused by high speed 
supersonic aircraft and missiles, what progress can be made directly on the 
original nonlinear equations? What are the modifying effects of viscosity 
and heat conduction? The answers to these questions in gas dynamics led 
to most of the fundamental ideas in nonlinear hyperbolic waves. The most 
outstanding new phenomenon of the nonlinear theory is the appearance of 
shock waves, which are abrupt jumps in pressure, density, and velocity: the 
blast waves of explosions and the sonic booms of high speed aircraft. But 
the whole intricate machinery of nonlinear hyperbolic equations had to be 
developed for their prediction, and a full understanding required analysis 
of the viscous effects and some aspects of kinetic theory. 

In this way a set of basic ideas became clear within the context of gas 
dynamics, although one should add that the investigation of more compli- 
cated cases and the search for deeper understanding of the kinetic theory 
aspects, for example, are still active fields. The basic mathematical theory, 
developed in gas dynamics, is appropriate for any system governed by 
nonlinear hyperbolic equations, and it has been used and refined in many 
other fields. 

In elasticity, the classical wave theory is also obtained after lineariza- 
tion. Even with the linear theory, the situation is more complicated 
because the system of equations leads to essentially two wave equations of 
the form (1.1) with two functions q,, g, and two wave speeds, c,, c,, which 
are associated with the different modes of propagation for compression 
waves and shear waves. The two functions g, and g, are coupled through 
the appropriate boundary conditions, and generally the problem is much 
more complicated than merely solving the wave equation (1.1). At a free 
surface of an elastic body, there is further complication in that surface 
waves, so-called Rayleigh waves, are possible; these are more akin to 
dispersive waves and travel at an intermediate speed between c, and ὁ). 
Because of these extra complications, the nonlinear theory has not been 
developed as fully as in gas dynamics. 

In electromagnetism there is also the complication that while different 
components of the electric and magnetic fields satisfy (1.1), they are 
coupled by additional equations and by the boundary conditions. 
Although the classical Maxwell equations are posed in linear form from 
the outset, there is much present interest in “nonlinear optics,” since 
devices such as lasers produce intense waves and various media react 
nonlinearly. | 

The corresponding mathematical theme started from the study of 
solutions of (1.1). The one dimensional equation for plane waves, 


Pit ~ C6 Pxx = 9; (1.5) 
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is particularly simple. It can be rewritten in terms of new variables 


Q=x-—Cof, B=xt+ Col, (1.6) 
as 
Pap = 9. . (1.7) 


This is immediately integrated to show that the general solution is 
p=f(a)+8(B) 
= f(x—Cot) +9(x+ cot), (1.8) 


where f and g are arbitrary functions. 

The solution is a combination of two waves, one with shape described 
by the function f moving to the right with speed co, and the other with 
shape g moving to the left with speed co. It would be even simpler if there 
were only one wave. The required equation corresponds to factoring (1.5) 
as 


and retaining only one of the factors. If we retain only 

Q, + Cop, = 9, (1.10) 
the general solution is 

p=f(x—Col). (1.11) 


This is the simplest hyperbolic wave problem. Although the classical 
problems led to (1.5), many wave motions have now been studied which do 
in fact lead to (1.10). Examples are flood waves, waves in glaciers, waves in 
traffic flow, and certain wave phenomena in chemical reactions. We shall 
start with these in Chapters 2 and 3. Just as in the classical problems, the 
original formulations lead to nonlinear equations and the simplest is 


y, + c(p) 9, =9, (1.12) 


where the propagation speed c(q) is a function of the local disturbance 9. 
The study of this deceptively simple-looking equation will provide all the 
main concepts for nonlinear hyperbolic waves. We follow the ideas which 
were developed first in gas dynamics, but now we develop them in the 
simpler mathematical context. The main nonlinear feature is the breaking 
of waves into shock waves, and the corresponding mathematical theory is 
the theory of characteristics and the special treatment of shock waves. This 
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is all presented in detail in Chapter 2. The theory is then applied and 
supplemented in Chapter 3 in a full discussion of the topics of flood waves 
and similar waves noted earlier. 

The first order equation (1.12) is called quasi-linear in that it is 
nonlinear in q but is linear in the derivatives φ,, p,. The general nonlinear 
first order equation for φίχ, 1) is any functional relation between q, 9,, 9,. 
This more general case as well as the extension to first order equations in ἢ 
independent variables is included in Chapter 2. 

In the framework of (1.12), shock waves appear as discontinuities in φ. 
However, the derivation of (1.12) usually involves approximations which 
are not strictly valid when shock waves arise. In gas dynamics the corresp- 
onding approximation is the omission of viscous and heat conduction 
effects. Again, the same mathematical effects can be seen in examples 
simpler than gas dynamics, even though the appropriate ideas were first 
explored there. These effects are included in Chapters 2 and 3. The 
simplest case is the equation 


D, + PY, = VY Px: (1.13) 


It was particularly stressed by Burgers (1948) as being the simplest one to 
combine typical nonlinearity with typical heat diffusion, and it is usually 
referred to as Burgers’ equation. It was probably introduced first by 
Bateman (1915). It acquired even more interest when it was shown by 
Hopf (1950) and Cole (1951) that the general solution could be obtained 
explicitly. Various questions can be investigated in great detail on this 
typical example, and then used with confidence in other cases where the 
full solution is not available and one must resort to special or approximate 
methods. Chapter 4 is devoted to Burgers’ equation and its solution. 

For two independent variables, usually the time and one space dimen- 
sion, the general system corresponding to (1.12) 15 


apenas 
vay + %9, + = 0, i=1,....7, (1.14) 


A 
for n unknowns u,(x,t). (The usual convention is used that summation 
j=1,...,n is to be understood for the repeated subscript 7.) For linear 
systems, the matrices A,, a, are independent of u, and the vector 5, is a 
linear expression 


ΠΝ 


b.=b..u, (1.15) 


i yj 


in u; (1.5) can be written in this form. When A,,, a,, δι are functions of u 


but not of its derivatives, the system is quasi-linear. Chapter 5 starts with a 
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discussion of the conditions necessary for (1.14) to be hyperbolic (and 
hence to correspond to hyperbolic waves), and it then turns to the general 
theory of characteristics and shocks for such hyperbolic systems. 

Gas dynamics is the subject that provided the basis for this material 
and is its most fruitful physical context. Chapter 6 is a fairly detailed 
account of gas dynamics for both unsteady problems and supersonic flow. 
Problems of cylindrical and spherical explosions are included, since they 
also reduce to two independent variables. 

For genuine two or three dimensional problems, we turn in Chapter 7 
to a more comprehensive discussion of solutions of the wave equation 
(1.1). It is perhaps a novelty in a book on wave propagation to delay this 
so long, and to give such an extensive discussion of nonlinear effects first. 
This is due to an ordering based on the number of dimensions rather than 
the difficulty of the concepts or the availability of mathematical tech- 
niques. Chapter 7 includes the aspects of solutions to (1.1) which reveal 
information about the nature of the wave motion involved and which offer 
the possibility of generalization to other wave systems. The prime example 
of this is the theory of geometrical optics, which extends to linear waves in 
nonhomogeneous media and is the basis for similar developments related 
to shock propagation in nonlinear problems. No attempt is made to give 
even a relatively brief account of the huge areas of diffraction and 
scattering theory, nor of the special features of elastic or electromagnetic 
waves. These are all too extensive to be adequately treated in a book that 
has such a broad range of topics already. 

Chapters 8 and 9 devoted to shock dynamics and propagation prob- 
lems related to sonic booms build on all this material and show how it can 
be brought to bear on difficult nonlinear problems. In these two chapters, 
intuitive ideas and approximations based on physical arguments are used 
to surmount the mathematical difficulties. Although these problems are 
drawn from fluid mechanics, it is hoped that the results and the style of 
thinking will be useful in other fields. 

The final chapter on hyperbolic waves concerns those situations where 
waves of different orders are present simultaneously. A typical example is 
the equation 


η(φ, — C6Pex) +O + ApH, = 0. (1.16) 


This is hyperbolic with characteristic velocities +c, determined from the 
second order wave operator. Yet if 7 is small, the lower order wave 
operator g,+ app, =0 should be a good approximation in some sense, and 
this predicts waves with speed ap. It turns out that both kinds of wave play 
important roles, and there are important interaction effects between the 
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two. The higher order waves carry the “first signal” with speed cy, but the 
“main disturbance” travels with the lower order waves at speed ay. In the 
nonlinear counterparts to (1.16) this has important bearing on properties of 
shocks and their structure. This general topic is taken up in Chapter 10. 


1.3 Dispersive Waves 


Dispersive waves are not classified as easily as hyperbolic waves. As 
explained in connection with (1.3), the discussion stems from certain types 
of oscillatory solution representing a train of waves. Such solutions are 
obtained from a variety of partial differential equations and even certain 
integral equations. One rapidly realizes that it is the dispersion relation, 
written 


w= W(x), (1.17) 


connecting the frequency w and the wave number x, which characterizes 
the problem. The source of this relation in the particular system of 
equations governing the problem is of subsidiary importance. Some of the 
typical examples are the beam equation 


φ, t+ Y Perey =O, w= + yK?, (1.18) 


the linear Korteweg-deVries equation 
φ, + CoP, + ¥9,,,. = 9, W = Cok — PK, (1.19) 


and the linear Boussinesq equation 


—1/2 
Pit — αφ,, = Bg o= + ak (1+ B77) : . (1.20) 


Equations 1.19 and 1.20 appear in the approximate theories of long water 
waves. The general equations for linear water waves require more detail to 
explain, but the upshot is a solution (1.3) for the displacement of the 
surface with 


w= +(gxtanhkh)'/”, (1.21) 


where h is the undisturbed depth and g is the gravitational acceleration. 
Another example is the classical theory for the dispersive effects of 
electromagnetic waves in dielectrics; this leads to 


(w? — v6) (w? — ck?) = οὖν, | (1.22) 
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where c, is the speed of light, νρ is the natural frequency of the oscillator, 
and ν, is the plasma frequency. 

For linear problems, solutions more general than (1.3) are obtained by 
superposition to form Fourier integrals, such as 


p= [- F(x) cos (xx Wt) dk, (1.23) 
0 


where W(«) is the dispersion function (1.17) appropriate to the system. 
Formally, at least, this is a solution for arbitrary F(«), which is then chosen 
to fit the boundary or initial conditions, with use of the Fourier inversion 
theorem. 

The solution in (1.23) is a superposition of wavetrains of different 
wave numbers, each traveling with its own phase speed 


c(k) = 2.1 (1.24) 


K 


As time evolves, these different component modes “disperse,” with the 
result that a single concentrated hump, for example, disperses into a whole 
oscillatory train. This process is studied by various asymptotic expansions 
of (1.23). The key concept that comes out of the analysis is that of the 
group velocity defined as 
dw 

C(x)= he (1.25) 
The oscillatory train arising from (1.23) does not have constant wave- 
length; the whole range of wave numbers x is still present. In a sense to be 
explained, the different values of wave number propagate through this 
oscillatory train and the speed of propagation is the group velocity (1.25). 
In a similar sense it is found that energy also propagates with the group 
velocity. For genuinely dispersive waves, the case Wa x is excluded so that 
the phase velocity (1.24) and the group velocity (1.25) are not the same. 
And it is the group velocity which plays the dominant role in the propaga- 
tion. 

In view of its great importance, and with an eye to nonuniform media 
and nonlinear waves, it is desirable to find direct ways of deriving the 
group velocity and its properties without the intermediary of the Fourier 
analysis. This can be done very simply on an intuitive basis, which can be 
justified later. Assume that the nonuniform oscillatory wave is described 
approximately in the form 


y=acos8, (1.26) 
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where a and @ are functions of x and: t. The function @(x, ἢ) is the “phase” 
which measures the point in the cycle of cos@ between its extreme values 
of +1, and a(x, ?) is the amplitude. The special uniform wavetrain has 


a=constant, @=Kx~—wt, w=W(k). (1.27) 


In the more general case, we define a Jocal wave number k(x, 1?) and a local 
frequency w(x,7) by 


k(x,t)= <2, Boe (1.28) 


Assume now that these are still related by the dispersion relation 


w= W(k). (1.29) 
This is then an equation for 6: 
00 00 
oF Ψ[ Ὁ 22 )- 0, (1.30) 


and its solution determines the kinematic properties of the wavetrain. It is 
more convenient to eliminate θ᾽ from (1.28) to obtain 


a t ae =O: (1.31) 


and to work with the pair of relations (1.29) and (1.31). Replacing w by 
W(k) in (1.31), we have 


0k 


τ +C(k) (1.32) 


where C(k) is the group velocity defined in (1.25). This equation for k is 
just the simplest nonlinear hyperbolic equation given in (1.12)! It may be 
interpreted as a wave equation for the propagation of k with speed C(k). 
In this rather subtle way, hyperbolic phenomena are hidden in dispersive 
waves. This may be exploited to bring the methods of Part I to bear on 
dispersive wave problems. 

The more intuitive analysis of group velocity indicated here is readily 
extended to more dimensions and to nonuniform media where the exact 
solutions are either inconvenient or unobtainable. The results then usually 
may be justified directly as the first term in an asymptotic solution. These 
basic questions with emphasis on the understanding of group velocity 
arguments are studied in Chapter 11. 

Once the group velocity arguments are established, they provide a 
surprisingly simple yet powerful method for deducing the main features of 
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any linear dispersive system. A wide variety of such cases is given in 
Chapter 12. 

It is easy to show asymptotically from the Fourier integral (1.23) that 
energy ultimately propagates with the group velocity. For purposes of 
generalization, it is again important to have direct approaches to this basic 
result. Some of these are explained in Chapter 11, but until recently there 
was no wholly satisfactory approach. In the last few years, the problem has 
been resolved as an offshoot of the investigation of the corresponding 
questions for nonlinear waves. The nonlinear problems required a more 
powerful approach altogether, and eventually the possibility of using 
variational principles was realized. These appear to provide the correct 
tools for all these questions in both linear and nonlinear dispersive waves. 
Judging from its recent success, this variational approach has led to a 
completely fresh view of the subject. It is taken up for linear waves in 
preliminary fashion in Chapter 11 and the full nonlinear version is de- 
scribed in Chapter 14. 

The intermediate Chapter 13 is on the subject of water waves. This 15 
perhaps the most varied and fascinating of all the subjects in wave motion. 
It includes a wide range of natural phenomena in the oceans and rivers, 
and suitably interpreted it applies to gravity waves in the atmosphere and 
other fluids. It has provided the impetus and background for the devel- 
opment of dispersive wave theory, with much the same role that gas 
dynamics has played for hyperbolic waves. In particular, the fundamental 
ideas for nonlinear dispersive waves originated in the study of water waves. 


1.4 Nonlinear Dispersion 


In 1847 Stokes showed that the surface elevation ἡ in a plane wave- 
train on deep water could be expanded in powers of the amplitude a as 


n=acos (Kx —wt) + 4xa*cos2(Kx — wt) 
+ 37a? cos3(Kx—wt)+---, (1.33) 
where 
w= gk(1+x7a7+---). (1.34) 


The linear result would be the first term in (1.33) in agreement with (1.3) 
and the dispersion relation would be 


w? = gk, (1.35) 
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in agreement with (1.21) since one takes the limiting form kh—>00 for deep 
water. There are two key ideas here. First, there exist periodic wavetrain 
solutions in which the dependent variables are functions of a phase 
6 = kx — wt, but the functions are no longer sinusoidal; (1.33) is the Fourier 
series expansion of the appropriate function η(θ). The second crucial idea 
is that the dispersion relation (1.34) also involves the amplitude. This 
introduces a qualitatively new feature and the nonlinear effects are not 
merely slight corrections. 

In 1895 Korteweg and deVries showed that long waves, in water of 
relatively shallow depth, could be described approximately by a nonlinear 
equation of the form 


η, + (cot c\n)n, + 77,,, =9, (1.36) 


where cy, ὦ); and ν are constants. A linearization of this for very small 
amplitudes would drop the term c,mn,; the resulting linear equation has 
solutions 


n=acos(kx—of), 
; (1.37) 


W = Cok — VK. 


One could improve on this by Stokes-type expansions in the amplitude. 
But one can do better: Korteweg and deVries showed that periodic 
solutions 


n=f(0), O=Kx-wt 


of (1.36) could be found in closed form, and without further approxima- 
tion, in terms of Jacobian elliptic functions. Since f(@) was found in terms 
of the elliptic function cn@, they named the solutions cnoidal waves. This 
work endorses the general conclusions of Stokes’ work. First, the existence 
of periodic wavetrains is demonstrated explicitly. Second, f(@) contains an 
arbitrary amplitude a, and the solution includes a specified dispersion 
relation between w, x, and a, the most important nonlinear effect being 
again the inclusion of the amplitude in this relation. 

But even more was found. One limit of cn@ (as the modulus tends to 
1) is the sech function. Either by taking this limit or directly from (1.36), 
the special solution 


1/2 


n= aseet? { (5) (xn) |, (1.38) 


U=cyt ea (1.39) 
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may be established. In this limit the period has become infinite and (1.38) 
represents a single hump of positive elevation. It is the “solitary wave,” 
discovered experimentally by Scott Russell (1844), and previously analyzed 
on an approximate basis by Boussinesq (1871) and Rayleigh (1876). The 
inclusion of the solitary wave with the periodic wavetrains in the same 
analysis was an important step. Equation 1.39 for the velocity of propaga- 
tion U in terms of the amplitude is the remnant of the dispersion relation 
in this nonperiodic case. 


Although the equation originated in water waves, it was subsequently 
realized that the Korteweg-deVries equation is one of the simplest proto- 
types that combines nonlinearity and dispersion. In this respect it is 
analogous to Burgers’ equation, which combines nonlinearity with diffu- 
sion. It has now been. derived as a useful equation in other fields. 

In recent years other simple equations have been derived in various 
fields and also used as prototypes to develop and test ideas. Notable 
among these are the equation 


φ, Py ἘΚ (φ) =9, (1.40) 
a natural generalization of the linear Klein-Gordon equation, and 
ip, ty, + |p/>p =0, (1.41) 


a generalization of Schrédinger’s equation. We return to comment on these 
later. 

First we must consider the question of how to build further on Stokes’ 
general result, confirmed by many other examples, that the existence of 
periodic wavetrains is a typical feature of nonlinear dispersive systems. 
These solutions are the counterparts of (1.3) but one cannot proceed by 
simple Fourier superposition. However, the eventual description of many 
important results in linear theory is in terms of the group velocity for 
modulated wavetrains as described following (1.26). These ideas are not 
crucially dependent on the Fourier synthesis and a theory of nonlinear 
group velocities can be developed. The appropriate analysis can be put in a 
general, concise form using the variational techniques already referred to. 
The theory is given in Chapter 14. The dependence of the dispersion 
relation on the amplitude introduces a number of new phenomena (for 
example, there are two group velocities) and these are discussed in general 
terms in Chapter 15. In addition to the original problems of water waves, 
one of the main fields of application is the new, rapidly expanding field of 
nonlinear optics. A selection of applications to both fields is given in 
Chapter 16. 
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One of the most interesting topics in nonlinear optics is the self- 
focusing of beams, and (1.41) arises in this context. Equation 1.40, particu- 
larly in the so-called Sine-Gordon case of 


Pi — φ, + sing =0, | (1.42) 


arises in a number of areas. Both of these equations share with the 
Korteweg-deVries equation in having solitary wave solutions as limiting 
cases. Solitary waves were always of obvious interest, since they are strictly 
nonlinear phenomena with no counterparts in linear dispersive theory. But 
until recently little further was known. Now, stemming from the remark- 
able work of Gardner, Greene, Kruskal, and Miura(1967)on the Korteweg- 
deVries equation and Perring and Skyrme (1962) and Lamb (1967, 1971) 
on the Sine-Gordon equation, families of exact solutions representing 
interacting solitary waves have been found. The surprising result is that 
solitary waves retain their individuality under interaction and eventually 
emerge with their original shapes and speeds. These solutions are only one 
class obtained in a more general attack on the equations, with further 
results on the solutions for arbitrary initial conditions being fairly com- 
plete. Zakharov and Shabat (1972) extended the methods of Gardner et al. 
to the cubic Schrédinger equation (1.41) and found similar results. An 
account of these important and ingenious investigations is given in Chapter 
17. 


Part I 


HYPERBOLIC WAVES 


CHAPTER 2 


Waves and First Order Equations 


We start the detailed discussion of hyperbolic waves with a study of 
first order equations. As noted in Chapter I, the simplest wave equation is 


P,+¢op,=9, c =constant. (2.1) 


When this equation arises, the dependent variable is usually the density of 
something so we now use the symbol p rather than the all-purpose symbol 
φ of the introduction. The general solution of (2.1) is p= f(x — cof), where 
f(x) is an arbitrary function, and the solution of any particular problem 
consists merely of matching the function f to initial or boundary values. It 
clearly describes a wave motion since an initial profile f(x) would be 
translated unchanged in shape a distance cof to the right at time ¢. At two 
observation points a distance s apart, exactly the same disturbance would 
be recorded with a time delay of s/cp. 
Although this linear case is almost trivial, the nonlinear counterpart 


p, + c(p)p, =9, (2.2) 


where c(p) is a given function of p, is certainly not and a study of it leads 
to most of the essential ideas for nonlinear hyperbolic waves. As remarked 
earlier, many of the classical examples of wave propagation are described 
by second or higher order equations such as the wave equation οὖν ῳ = 9,,, 
but a surprising number of physical problems do lead directly to (2.2) or 
extensions of it. Examples will be given after a preliminary discussion of 
the solution. Even in higher order problems, one often searches for special 
solutions or approximations that involve (2.2). 


2.1 Continuous Solutions 


One approach to the solution of (2.2) is to consider the function p(x, t) 
at each point of the (x,¢) plane and to note that p,+c(p)p, is the total 
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derivative of p along a curve which has slope 


ax _ 
at every point of it. For along any curve in the (χ, ἢ) plane, we may 
consider x and p to be functions of t, and the total derivative of p is 
d 0 0 
ρ ὃρ dx 9p 


dt δι at ox’ 


The total derivative notation should be sufficient to indicate when x and p 
are being treated as functions of ¢ on a certain curve; the introduction of 
new symbols each time this is done eventually becomes confusing. We now 
consider a curve Ὁ in the (χ,1) plane which satisfies (2.3). Of course such a 
curve cannot be determined explicitly in advance since the defining equa- 
tion (2.3) involves the unknown values of p on the curve. However, its 
consideration will lead us to a simultaneous determination of a possible 
curve @ and the solution p on it. On © we deduce from the total 
derivative relation and from (2.2) that 


do dx 
ae BN egy: 2.4 


We first observe that p remains constant on ὦ. It then follows that c(o) 
remains constant on @, and therefore that the curve C must be a straight 
line in the (x,t) plane with slope c(p). Thus the general solution of (2.2) 
depends on the construction of a family of straight lines in the (x, ἢ) plane, 
each line with slope c(p) corresponding to the value of p on it. This is easily 
done in any specific problem. 

Let us take for example the initial value problem 


p=f(x), t=0, -—wK<x<o, 


and refer to the (x,¢) diagram in Fig. 2.1. If one of the curves © intersects 
t=0 at x=& then p=/f(&) on the whole of that curve. The corresponding 
slope of the curve is c(f()), which we will denote by F(§); it is a known 
function of € calculated from the function c(p) in the equation and the 
given initial function f(). The equation of the curve then 15 


x=&+ F(é)t. 


This determines one typical curve and the value of p on it is f(€). Allowing 
€ to vary, we obtain the whole family: 


p=f(é), c= F(E)=c(f(€)) (2.5) 
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Fig. 2.1. Characteristic diagram for nonlinear waves. 


on 
x=f+ F(E)t. (2.6) 


We may now change the emphasis and use (2.5) and (2.6) as an 
analytic expression for the solution, free of the particular construction. 
That is, p is given by (2.5) where &(x, ἢ) is defined implicitly by (2.6). Let us 
check that this gives the solution. From (2.5), 


ρ,.Ξ Γ (δ) ρ.Ξ Γ (ξ)ξ,. 
and from the 1 and x derivatives of (2.6), 

O= F(E)+ {1+ F’(E) thE, 

1={1+F’(é)t}&. 


Therefore 


FOS) ΒΕ 


a 1+ F'(€)t? Oe T+ (Et (2.1) 


and we see that 


p,+c(p)p, =0 
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since c(p)= F(é). The initial condition p= f(x) is satisfied because =x 
when ¢=0. 

The curves used in the construction of the solution are the 
characteristic curves for this special problem. Similar characteristics play an 
important role in all problems involving hyperbolic differential equations. 
In general, characteristic curves do not have the property that the solution 
remains constant along them. This happens to be true in the special case of 
(2.2); it is not the defining property of characteristics. The general defini- 
tions will be considered later, but it will be convenient now to refer to the 
curves defined by (2.3) as characteristics. 

The basic idea of wave propagation is that some recognizable feature 
of the disturbance moves with a finite velocity. For hyperbolic equations, 
the characteristics correspond to this idea. Each characteristic curve in 
(x,t) space represents a moving wavelet in x space, and the behavior of the 
solution on a characteristic curve corresponds to the idea that information 
is carried by that wavelet. The mathematical statement in (2.4) may be 
given this type of emphasis by saying that different values of p “propagate” 
with velocity c(p). Indeed, the solution at time ¢ can be constructed by 
moving each point on the initial curve p= f(x) a distance c(p)t to the right; 
the distance moved is different for the different values of p. This is shown 
in Fig. 2.2 for the case c’(p) > 0; the corresponding time levels are indicated 
in Fig. 2.1. The dependence of c on p produces the typical nonlinear 
distortion of the wave as it propagates. When c’(p)>0, higher values of p 
propagate faster than lower ones. When c’(p)<0, higher values of p 
propagate slower and the distortion has the opposite tendency to that 
shown in Fig. 2.2. For the linear case, c is constant and the profile is 
translated through a distance ct without any change of shape. 

It is immediately apparent from Fig. 2.2 that the discussion is far from 
complete. Any compressive part of the wave, where the propagation 
velocity is a decreasing function of x, ultimately “breaks” to give a 


x 


Fig. 2.2. Breaking wave: successive profiles corresponding to the times 0, ¢), tg, 135 in Fig. 2.1. 
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triple-valued solution for p(x,t). The breaking starts at the time indicated 
by t=¢, in Fig. 2.2, when the profile of p first develops an infinite slope. 
The analytic solution (2.7) confirms this and allows us to determine the 
breaking time t,. On any characteristic for which F’(é)<0, p, and 9, 
become infinite when 
ΝΕ. 
F’(é) 

Therefore breaking first occurs on the characteristic €=&€, for which 
F’(€) <0 and |F’(§)| is a maximum; the time of first breaking is 


== ΞΕ ἘΞ (2.8) 


F’(é) 


This development can also be followed in the (χ, ἢ) plane. A compressive 
part of the wave with F’(é)<0 has converging characteristics; since the 
characteristics are straight lines, they must eventually overlap to give a 
region where the solution is multivalued, as in Fig. 2.1. This region may be 
considered as a fold in the (x,t) plane made up of three sheets, with 
different values of p on each sheet. The boundary of the region is an 
envelope of characteristics. The family of characteristics is given by (2.6) 
with € as parameter. The condition that two neighboring characteristics &, 
ξ- δὲ intersect at a point (x, /) is that | 


x=f+F(é)t 
and 


x=E+ 66+ F(E+ 6€)t 
hold simultaneously. In the limit 6-0, these give 
x=£+ F(£)t and O=1+F'(E)t 


for the implicit equations of an envelope. The second of these relations 
shows that an envelope is formed in t >0 by those characteristics for which 
F’(§) <0. The minimum value of t on the envelope occurs for the value of € 
for which — F’(g) is maximum. This is the first time of breaking in 
agreement with (2.8). If F”(€) is continuous, the envelope has a cusp at 
t=t,, €=&,, as shown in Fig. 2.1. 

An extreme case of breaking arises when the initial distribution has a 
discontinuous step with the value of c(p) behind the discontinuity greater 
than that ahead. If we have the initial functions 


ρ. χΧΡῸ 
ρ» χέεθ 


1(χ)Ξ 
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and 


ὌΝ οιΞο(ρι), x>0 
οχΞο(ρ)), x<0 


with c,>c,, then breaking occurs immediately. This is shown in Fig. 2.3 
for the case c’(p)>0, p,>p,. The multivalued region starts right at the 
origin and is bounded by the characteristics x=c,t and x=cy,t; the 
boundary is no longer a cusped envelope since F and its derivatives are not 
continuous. Nevertheless, the result may be considered as the limit of a 
series of smoothed-out steps, and the breaking point moves closer to the 
origin as the initial profile approaches the discontinuous step. 

On the other hand, if the initial step function is expansive with c,<c,, 
there is a perfectly good continuous solution. It may be obtained as the 
limit of (2.5) and (2.6) in which all the values of F between c, and c, are 
taken on characteristics through the origin §=0. This corresponds to a fan 
of characteristics in the (x,/) plane as in Fig. 2.4. Each member of the fan 
has a different slope F but the same ξ. The function F is a step function 
but we use all the values of F between c, and c, on the face of the step and 
take them all to correspond to =0. In the fan, the solution (2.5), (2.6) then 
reads 


c=F, x= Ft, for c,< F<c,, 


Fig. 2.3. Centered compression wave with overlap. 
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t 


Fig. 2.4. Centered expansion wave. 


and by elimination of F we have the simple explicit solution for c: 
x x 
=—, ὃγς. -- « γι 
ἔσο 7 Ὁ 1 


The complete solution for c is 


x 
1» (Ὶ «. ἘΞ τ ἢ 
ΐ 
x 
c= os <> ξ δὴ (2.9) 
x 
Co; ΡῈ <C>. 


The relation c=c(p) can be solved to determine p. For the compressive 
step, c,>c,, the fan in the (x,f) plane is reversed to produce the overlap 
shown in Fig. 2.3. 

In most physical problems where this theory arises, p(x,7) is just the 
density of some medium and is inherently single-valued. Therefore when 
breaking occurs (2.2) must cease to be valid as a description of the physical 
problem. Even in cases such as water waves where a multivalued solution 
for the height of the surface could at least be interpreted, it is still found 
that (2.2) is inadequate to describe the process. Thus the situation is that 
some assumption or approximate relation in the formulation leading to 
(2.2) is no longer valid. In principle one must return to the physics of the 
problem, see what went wrong, and formulate an improved theory. How- 
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ever, it turns out, as we shall see, that the foregoing solution can be saved 
by allowing discontinuities into the solution; there is then a single-valued 
solution with a simple jump discontinuity to replace the multivalued 
continuous solution. This requires some mathematical extension of what 
we mean by a “solution” to (2.2), since strictly speaking the derivatives of p 
will not exist at a discontinuity. It can be done through the concept of a 
“weak solution.” But it is important to appreciate that the real issue is not 
just a mathematical question of extending the solution of (2.2). The 
breakdown of the continuous solution is associated with the breakdown of 
some approximate relation in the physics, and the two aspects must be 
considered together. It is found, for example, that there are several possible 
families of discontinuous solutions, all satisfactory mathematically; the 
nonuniqueness can be resolved only by appeal to the physics. 

Clearly then, we cannot proceed further without discussion of some 
physical problems. The prototype is the nonlinear theory of waves in a gas 
and the formation of shock waves. When viscosity and heat conduction are 
ignored, the equations of gas dynamics have breaking solutions similar to 
the preceding ones. As the gradients become steep, just before breaking, 
the effects of viscosity and heat conduction are no longer negligible. These 
effects can be included to give an improved theory and waves no longer 
break in that theory. There is a thin region, a shock wave, in which 
viscosity and heat conduction are crucially important; outside the shock 
wave, viscosity and heat conduction may still be neglected. The flow 
variables change rapidly in the shock. This shock region is idealized into a 
discontinuity in the “extended” inviscid theory, and only shock conditions 
relating the jumps of the flow variables across the discontinuity need to be 
added to the inviscid theory. 

We will study all these various aspects in detail. However, gas dynam- 
ics is not the simplest example, since it involves higher order equations, 
and we shall discuss the essential ideas first in the context of the simpler 
first order problems. It should be remembered, though, that these ideas 
were developed for gas dynamics, and we are reversing the chronological 
order. The basic ideas were elucidated by Poisson (1807), Stokes (1848), 
Riemann (1858), Earnshaw (1858), Rankine (1870), Hugoniot (1889), 
Rayleigh (1910), Taylor (1910)—a most impressive list. The time required 
indicates that putting the different aspects together was quite a compli- 
cated affair. 


2.2 Kinematic Waves 


In many problems of wave propagation there is a continuous distribu- 
tion of either material or some state of the medium, and (for a one 
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dimensional problem) we can define a density p(x,?f) per unit length and a 
flux g(x,?) per unit time. We can then define a flow velocity v(x, 7) by 


q 
vo=-. 
p 


Assuming that the material (or state) is conserved, we can stipulate that the 
rate of change of the total amount of it in any section x, >x >x, must be 
balanced by the net inflow across x, and x,. That is, 


d * 
< f o(x,t) dx + q(x,,t) -- φία;,,.) =0. (2.10) 
x2 


If ρί(χ, ἢ has continuous derivatives, we may take the limit as x, x, and 
obtain the conservation equation 


dp 0g | 

~— + ~— =0. 2.11 

ot Ox ( ) 
The simplest wave problems arise when it is reasonable, on either theoreti- 
cal or empirical grounds, to postulate (in a first approximation!) a func- 
tional relation between q and p. If this is written as 


q=Q(p), (2.12) 
(2.11) and (2.12) form a complete system. On substitution we have 
p,+¢(p)p, =0 (2.13) 
where 
c(p)=Q'(p). (2.14) 


This leads to our (2.2) and a typical solution is given by (2.5) to (2.6). The 
breaking requires us to reconsider both the mathematical assumption that 
p and g have derivatives and the physical assumption that g=Q(p) is a 
good approximation. To fix ideas for the further development of the theory 
some specific examples are noted briefly here. We shall return to them in 
Chapter 3 for a more detailed discussion after the theoretical ideas are 
complete. 

An amusing case (which is also important) concerns traffic flow. It is 
reasonable to suppose that some essential features of fairly heavy traffic 
flow may be obtained by treating a stream of traffic as a continuum with 
an observable density p(x,), equal to the number of cars per unit length, 
and a flow q(x,?), equal to the number of cars crossing the position x per 
unit time. For a stretch of highway with no entries or exits, cars are 
conserved! So we stipulate (2.10). For traffic it also seems reasonable to 
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argue that the traffic flow ῳ is determined primarily by the local density p 
and to propose (2.12) as a first approximation. Such functional relations 
have been studied and documented to some extent by traffic engineers. We 
can then apply the theory. But it is clear in this case that when breaking 
occurs there is no lack of possible explanations for some breakdown in the 
formulation. Certainly the assumption g= Q(p) 15 a very simplified view of 
a very complicated phenomenon. For example, if the density is changing 
rapidly (as it is near breaking), one expects the drivers to react to more 
than the local density and one also expects that there will be a time lag 
before they respond adequately to the changing conditions. One might also 
question the continuum assumption itself. 

Another example is flood waves in long rivers. Here p is replaced by 
the cross-sectional area of the channel, 4, and this varies with x and ¢ as 
the level of the river rises. If g is the volume flux across the section, then 
(2.10) between A and g expresses the conservation of water. Although the 
fluid flow is extremely complicated, it seems reasonable to start with a 
functional relation g=Q(A) as a first approximation to express the in- 
crease in flow as the level rises. Such relations have been plotted from 
empirical observations on various rivers. But it is again clear that this 
assumption is an oversimplification which may well have to be corrected if 
troubles arise in the theory. 

A similar example, proposed and studied extensively by Nye (1960), is 
the example of glacier flow. The flow velocity is expected to increase with 
the thickness of the ice, and it seems reasonable to assume a functional 
dependence between the two. 

In chromatography and in similar exchange processes studied in 
problems of chemical engineering, the same theory arises. The formulation 
is a little more complicated. The situation is that a fluid carrying dissolved 
substances or particles or ions flows through a fixed bed and the material 
being carried is partially adsorbed on the fixed solid material in the bed. 
The fluid flow is idealized to have a constant velocity V. Then if p, is the 
density of the material carried in the fluid, and p, is the density deposited 
on the solid, 


P= P+ Ps, q = Voy. 
Hence the conservation equation (2.11) reads 
0 0 = 
5; (Opt Ps) + a (Vey) Ξ0. 


A second relation concerns the rate of deposition on the solid bed. The 
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exchange equation 


dp, 
ry = k,(A — p,) Pp— k,p,(B — pr) 


is apparently the simplest equation with the required properties. The first 
term represents deposition from the fluid to the solid at a rate proportional 
to the amount in the fluid, but limited by the amount already on the solid 
up to a capacity A. The second term is the reverse transfer from the solid 
to the fluid. (In some processes, the second term 15 just proportional to p,; 
this is the limit Boo, k,B finite.) In equilibrium, the right hand side of 
the equation vanishes and p, is a definite function of p,. In slowly varying 
conditions, with relatively large reaction rates k, and k,, we may take a 
first approximation in which the right hand side still vanishes (“quasi- 
equilibrium”) and we have 
es ae 
Pe OTB + (ke, a) oy 
Thus p, is a function of p,; hence q is a function of p. When changes 
become rapid, just before breaking, the term dp,/d¢ in the rate equation 
can no longer be neglected. 

As a different type of example, the concept of group velocity can be 
fitted into this general scheme. In linear dispersive waves, as already noted 
following (1.26), there are oscillatory solutions with a local wave number 
k(x,t) and a local frequency w(x,t). Thus k is the density of the waves— 
the number of wave crests per unit length—and w is the flux—number of 
wave crests crossing the position x per unit time. If we expect that wave 
crests will be conserved in the propagation, we have, in differential form, 
the conservation equation 


ok , Ow _ 
Or ΤΣ Ox 


In addition, k and w are related by the dispersion relation 


w=w(k). 
Hence 


oe (k) 


We have a wave propagation for the variations of the local wave number 
of the “carrier” wavetrain, and the propagation velocity is dw/dk. This is 
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the group velocity. These ideas will be considered in full detail in the later 
discussion of dispersive waves. 

The wave problems listed here depend primarily on the conservation 
equation (2.11), and for this reason they were given the name kinematic 
waves (Lighthill and Whitham, 1955) in contrast to the usual acoustic or 
elastic waves which depend strongly on how the acceleration is determined 
through the laws of dynamics. 

After this review of some of the physical problems, we return to the 
study of breaking and shock waves in order to complete the theory. 
Further details of the physical problems are pursued in Chapter 3. 


2.3 Shock Waves 


When breaking occurs we question the assumption g= Q(p) in (2.12) 
and also the differentiability of p and qg in (2.11). But, provided the 
continuum assumption is adequate, we still insist on the conservation 
equation (2.10). 

Consider first the mathematical question of whether discontinuities 
are possible. Certainly a simple jump discontinuity in p and in q is feasible 
as far as (2.10) is concerned; all the expressions in (2.10) have a meaning. 
Does (2.10) provide any restriction? To answer this, suppose there is a 
᾿ discontinuity at x=5s() and that x, and x, are chosen so that x, >s(t) 
>xX,. Suppose p and q and their first derivatives are continuous in x, >x 
> s(t) and in s(t)>x >x,, and have finite limits as x s(t) from above and 
below. Then (2.10) may be written 


ΧῚ 


_d ps d 
dxmt)—art=A [ς- eluthdet τ [ρα ἡ ἀν 


s(t) x 
=p(s~,t)5—p(s*,t)3+ f p,(x,t)dx+ [ " p, (x,t) dx, 
x2 s(t) 


where p(s” ,¢), p(s*,t) are the value of ρίχ,) as x—»s(t) from below and 
above, respectively, and s=ds/dt. Since p, is bounded in each of the 
intervals separately, the integrals tend to zero in the limit as x,-s*, 
χη 5. Therefore 


4(5.,1)ὴ -- φί 5 ",η)Ξ ἰρ(57,ὴ —p(s*,t) }3. 


A conventional notation is to use ἃ subscript 1 for the values ahead of the 
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shock and a subscript 2 for values behind. Then if U is the shock velocity, 
δ, 


92— 9, = U(p2— p)). (2.15) 
The condition may also be written in the form 
— U[p]+[4q]=9, (2.16) 


where the brackets indicate the jump in the quantity. This form gives a 
nice correspondence between the shock condition and the differential 
equation (2.11), the correspondence being 
9 
-- «»- -- : 2.17 
we-UL Sof | (2.17) 
We can now extend our solutions of (2.10) to allow such discontinui- 
ties. In any continuous part of the solution, (2.11) will still be satisfied and 
the assumption (2.12) may be retained. Since g= Q(p) in the continuous 
parts, we have g,= Q(p,) and g, = Q(p,) on the two sides of any shock, and 
the shock condition (2.15) may be written 


U= Q(e2) -- Q(1) . (2.18) 
P2~ Py 
The problem then reduces to fitting shock discontinuities into the solution 
(2.5), (2.6) in such a way that (2.18) is satisfied and multivalued solutions 
are avoided. 
The simplest case is the problem 


p=p,, c=c(p,)=c, x>0, a, 
P=p,, c=c(p,)=c, x<O, 


with c,>c,. The breaking solution was indicated in Fig. 2.3. Now a 


single-valued solution is possible which is just a shock moving with velocity 
(2.18): 


p=p, x>Ut, 
P=p, x<Ut. 
This is represented schematically in Fig. 2.5. 


A popular way to derive the shock condition is to view this particular 
solution from a frame of reference in which the shock is at rest, as shown 
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U 

P= Po P=P, 
q=q2=Q(po)| q=q, = Q(p)) 
-----.....-..» a 


Fig. 2.5. Flow quantities for moving shock. 


PoP Pah 
Q=d2-Up2 |] q=q, -Up, 
-- Ὁ» -.- “ἅ» 


Fig. 2.6. Flow quantities relative to stationary shock. 


in Fig. 2.6. The relative flows become q, — Up, and 4) -- Up,. The conserva- 
tion law may be stated immediately in the form 


41-- Up, =%— Up, 


and (2.15) follows. 

Before proceeding with the general problem of shock fitting, we 
consider the alternative view that the differential equation (2.11) is 
adequate but that the assumed relation (2.12) is insufficient. 


2.4 Shock Structure 


As a particular case, we need to find and examine a more accurate 
description of the simple discontinuous solution represented in Fig. 2.5. 
This is the problem of finding the “shock structure.” 

In many problems of kinematic waves, it would be a better approxi- 
mation to suppose that q is a function of the density gradient p, as well as 
p. A simple assumption is to take 


q=Q(p)— »p,, (2.19) 


where ν is a constant. In traffic flow, for example, we may argue that 
drivers will reduce their speed to account for an increasing density ahead, 
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and conversely. This argument would propose a positive value for ν, and 
we see below that the sign is important. If ν is small, in some suitable 
dimensionless measure, (2.12) is a good approximation provided p, 15 not 
relatively large. At breaking, p, becomes large and the correction term 
becomes crucial, however small » may be. Now in this formulation, 
consider continuous solutions. From (2.11) and (2.19), they satisfy 


p,+c(p)p,=vp,,, ¢(p) = Q'(p). (2.20) 


The term c(p)p, in (2.20) leads to steepening and breaking. On the 
other hand, the term vp,, introduces diffusion typical of the heat equation 


P, = PPx x: 


For the heat equation, the solution of the initial step function problem 


P=P), x >0, | 1=0 


p=p, x<0, 
1S 
-- χ νάνι 
p=p,+ a” f eS dt 
Va “-ὦ 


This represents a smoothed-out step approaching values p,, p, as x > +0, 
and with slope decreasing like (vt)~'/?. The two opposite tendencies of 
nonlinear steepening and diffusion are combined in (2.20). The signifi- 
cance of y >0 can be seen from the heat equation; solutions are unstable if 
ν«0. 

We now look within the framework of this more accurate theory for 
the solution to replace the one shown in Fig. 2.5. One obvious idea is to 
look for a steady profile solution in which 


p=p(X), X=x— Ut, 
where U is a constant still to be determined. Then from (2.20), 


{c(p) -- U} py = vpyy. 
Integrating once, we have 


Q(p)— Up+A=npy, (2.21) 


where A is a constant of integration. An implicit relation for p(X) is 
obtained in the form 


X Ν᾿ dp 
Ξ Ο(ρ)--σὕρτ A’ ae 


34 WAVES AND FIRST ORDER EQUATIONS Chap. 2 


but the qualitative behavior is more readily seen directly from (2.21). We 
are interested in the possibility of a solution which tends to constant states 
p—p, as X->+ 00, p>p, as X>— oo. If such a solution exists with py—>0 
as X-»+ 00, the arbitrary parameters U,A must satisfy 


Ο(ρι) — Up, + A= Q(p2) — Up, + A =0. 
In particular, 


_ 262) ~ Qo). 


2.23 
βρη Py ( 


U 


In such a solution, the relation between the velocity U and the two states 
at +0co is exactly the same as in the shock condition! 

The values p,, p, are zeros of Q(p)— Up+ A, and in general they are 
simple zeros. As p—>p, OF p, in (2.22), the integral diverges and X— + oo as 
required. If Ο(ρ)-- Up+ A «0 between the two zeros, and if ν is positive, 
we have p, <0 and the solution is as shown in Fig. 2.7 with p increasing 
monotonically from p, at +00 to p, at —oo. If Q(p)— Up+A>0 and 
vy >0O, the solution increases from p, at — oo to p, at +00. It is clear from 
(2.21) that if p,,p, are kept fixed (so that U,A are fixed), a change in ν can 
be absorbed by a change in the X scale. As v->0, the profile in Fig. 2.7 1s 
compressed in the X direction and tends in the limit to a step function 
increasing p from p, to p, and traveling with the velocity given by (2.23). 
This is exactly the discontinuous shock solution seen in Fig. 2.5. For small 
nonzero ν the shock is a rapid but continuous increase taking place over a 
narrow region. The breaking due to the nonlinearity is balanced by the 
diffusion in this narrow region to give a steady profile. 

One very important point is the sign of the change in p. A continuous 
wave carrying an increase of p will break forward and require a shock with 
P> >, if c’(p)>0; it will break backward and require a shock with p,<p, 
if c’(p) <0. The shock structure given by (2.21) must agree. As remarked 
above, v is always positive for stability, so the direction of increase of p 
depends on the sign of Ο(ρ) -- Up+ A between the two zeros p, and p,. But 


Fig. 2.7. Shock structure. 
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c’(p)= Ο΄ (p). Hence when c’(p)>0, O(p)— Up+ A <0 between zeros and 
the solution is as seen in Fig. 2.7 with p, >, as required. If c’(p)<0, the 
step is reversed and p,<p,. The breaking argument and the shock struc- 
ture agree. 


In the special case of a quadratic expression for Q(p), taken as 


O(p)=ap*+ Bo+y, (2.24) 


the integral in (2.22) is easily evaluated. The sign of a determines the sign 
of c’(p)= Q”(p) and we consider a >0, for definiteness. We may write 


| Q— Upt+ A= —a(p—p,)(92—p), 
where — | 


U=B+a(p,+p,),  A=ap,p.—y. 
Then (2.22) becomes 


, d Ν 
(oe Ξ- οὐ» "Ὁ ΑΝ (2.25) 


α(ρ--ρι)(ρ,--6) α(ρ,--ρ) ΡΊΡι, 


= | 


As X00, p—p, exponentially, and as X--— οὐ, p—p, exponentially. 
There is no precise thickness to the transition region, but we can introduce 
various measures of the scale, such as the length over which 90% of the 
change occurs or (p,—,) divided by the maximum slope |p,|. Clearly all 
such measures of thickness are proportional to 


(p21) ee 


If this is small compared with other typical lengths in the problem, the 
rapid shock transition is satisfactorily approximated by a discontinuity. We 
confirm that the thickness tends to zero as v0 for fixed p,,p,, but it also 
should be noted that sufficiently: weak shocks with (p,—p,)/p,-0 ulti- 
mately become thick for fixed », however small. For weak shocks Q(p) can 
always be approximated by a suitable quadratic over the range p, to ρ5, so 
that (2.25) applies. Even for moderately strong shocks it is a good overall 
approximation to the shape. 

The shock structure is only one special solution of (2.20), but from it 
we might expect in general that when »v-—>0 in some suitable nondimen- 
sional form, solutions of (2.20) tend to solutions of 


p,+c(p)p, =0 
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together with discontinuous shocks satisfying 
Ο(ρ) — Ο(ρι) 
CE 
βρη Py 

This is true when the solutions are compared at fixed (χ, ἢ with v0. 
However, the fact that the shock transition becomes very wide as (p, — p,)/ 
p,—0, for fixed ν, means that in any problem where the shocks ultimately 
tend to zero strength as t—»co, there may be some final stage with 
extremely weak shocks when the discontinuous theory will be invalid. This 
is often a very uninteresting stage, since the shocks must be very weak. 

Otherwise, we can say that the two alternative ways of improving on 
the unacceptable multivalued solutions agree. The use of discontinuous 
shocks is the easier analytically and can be carried further in more 
complicated problems. 

Confirmation in more detail would require some explicit solutions of 
(2.20) which involve shocks of varying strength. Although solutions are not 
known for a general Q(p), it turns out that (2.20) can be solved explicitly 
when Q(p) is once again a quadratic in p. If (2.20) is multiplied by c’(p), it 
may be written 


ς, Ἔ ες, = να΄ (ρ)ρ,χ 
= vc,,.— vce" (ρ)ρξ. (2.27) 
If Q(p) is quadratic, c(p) is linear in p, then c”(p)=0 and we have 
C, + CC, = VC, (2.28) 


This is Burgers’ equation and it can be solved explicitly. The main results 
are given in Chapter 4. For the present, we accept the evidence for 
pursuing discontinuous solutions of (2.2) bearing in mind that for ex- 
tremely weak shocks it will not be appropriate. For the extremely weak 
shocks, Q(p) can be approximated by a quadratic and Burgers’ equation 
can be used. 

The arguments in this section depend strongly on »>0. As noted 
previously, this is required for stability of the problem. Interesting cases of 
instability do occur, however, in traffic flow and flood waves. They are 
discussed in Chapter 3. 


2.5 Weak Shock Waves 


In a number of situations the shocks are weak in that (p,—1)/p, is 
small, but they are not so extremely weak that they may no longer be 
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treated as discontinuities. It is useful to note some approximations for such 
cases. 
The shock velocity 


_ 2(02)— O01) 


U 
βρη Py 
tends to the characteristic velocity 
- 42 
c(p)= 
(p) ip 


in the limit as the shock strength (p,—p,)/p,—-0. For weak shocks the 
expression for the shock velocity U may be expanded in a Taylor series in 
(02 — P1)/p, as 


ἢ ] " 2 
U=Q'(p,) + 2 (ρ2-- ρ4)0“ (ρι) + O(p.— 1). 
The propagation velocity c(p,)= Q’(p,) may also be expanded as 


c(p2) =¢(p,) + (ρ5--ρ4)0“ (ρι) + O(p.—p,) 
Therefore 


1 
U= 5 (c+ ¢) + Ο(ρ7-- ρα)", (2.29) 


where c,=c(p,) and c,=c(p,). To this approximation, the shock velocity is 
the mean of the characteristic velocities on the two sides of it. In the (x, ἢ) 
plane the shock curve bisects the angle between the characteristics which 
meet on the shock. This property is useful for sketching in the shocks, but 
it also simplifies the analytic determination of shock positions. Clearly the 
relation is exact when Q(p) is a quadratic. 


2.6 Breaking Condition 


A continuous wave breaks and requires a shock if and only if the 
propagation velocity c decreases as x increases. Therefore when the shock 
is included we have 


c,>U>«¢,, (2.30) 


where all velocities are measured positive in the direction of x increasing 


38 WAVES AND FIRST ORDER EQUATIONS Chap.2 


and the subscript 1 refers to the value of c just ahead of the shock (i.e., 
greater value of x) and the subscript 2 refers to the value of c just behind 
the shock. A shock produces an increase in c, and it is supersonic viewed 
from ahead and subsonic viewed from behind. As regards the jump 
condition alone, it would be feasible to fit in discontinuities with c,<c,. 
However, shocks with c,<c, could never be formed from a continuous 
wave and they are never required; they are excluded from consideration 
for this reason. | 3 

One question about this argument is the point that the solution 
represented in Fig. 2.5 might be set up with c,<c, (by some complicated 
but probably highly unrealistic device). Of course we have already noted in 
(2.9) and in Fig. 2.6 a satisfactory continuous solution for such initial 
conditions. Still, to be particularly awkward, one might insist that Fig. 2.5 
gives an alternative solution. The answer is that this proposed solution is 
unstable. That is, small perturbations would change the flow into some- 
thing quite different—the expansion fan solution of (2.9). This is a 
“disintegration argument” which is complementary to the “formation 
argument.” The instability will not be considered in detail in this chapter 
since the formation argument is convincing and unambiguous. For higher 
order equations the shock formation becomes harder to study and the 
instability arguments sometimes give an easier method to decide whether a 
particular shock satisfying the shock conditions is really possible. 

For gas dynamic shocks, the inequality corresponding to (2.30) 1s 
equivalent to the condition that the entropy of the gas increases as the gas 
passes through the shock. The entropy condition was the first argument for 
the irreversibility of shock waves, that is, that the shock transition goes only 
one way. However, conditions like (2.30) are more general. In some 
problems there is no obvious counterpart to entropy; in others, such as 
magnetogasdynamics, the entropy condition does not rule out some inad- 
missible shocks. by 

An alternative view of these criteria is that any acceptable discon- 
tinuous shock must have a satisfactory shock structure when described by 
more accurate equations. This is a more satisfactory point of view, since it 
appeals to a more realistic description of the phenomenon. However, the 
analysis may become prohibitive and one often resorts to the indirect 
arguments in the framework of the simpler theory. 

This alternative approach was checked in the discussion of shock 
structure in Section 2.4. When c’(p) >0 we found only a shock structure for 
0, >; since c’(p)>0, this is equivalent to c,>c,. When c’(p)<0, we 
found p,<p,, but the change in sign of c’(p) means that c,>c,. Since 
c(p)= Q(p), the shock velocity lies between the values c, and c, by Rolle’s 
theorem. 
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2.7 Note on Conservation Laws and Weak Solutions 


Mathematically, the composite solution composed of continuously 
differentiable parts satisfying 


dp ὃ0(ρ) 
Ot ox 


0 (2.31) 


together with jump discontinuities satisfying 


—U[p]+[Q(p)]=0 (2.32) 


can be considered a weak solution of (2.31). Briefly, the idea is as follows. 
Associate with (2.31) the equation 


= ! ἰρφ, Ὁ Θ(ρ)φ,) dx dt=0, (2.33) 


where R is an arbitrary rectangle in the (χ,2) plane, and ¢ is an arbitrary 
“test” function with continuous first derivatives in R and =O on the 
boundary of R. If p and Q(p) are continuously differentiable, (2.31) and 
(2.33) are equivalent. On the one hand, if (2.31) is multiplied by φ and 
integrated over R, we may deduce (2.33) after integration by parts. On the 
other hand, integration by parts on (2.33) leads to 


dp 90(ρ) 
Shae -- Ἰφάχάτο, 


and, since this must hold for all arbitrary continuous φ, (2.31) follows. 
However, (2.33) allows more general possibilities, since the admissible 
functions p(x,t) need not have derivatives. Functions p(x,?) which satisfy 
(2.33) for all test functions ¢ are called “weak solutions” of (2.31). 

We now investigate what this extended meaning of solution has 
achieved. Consider the possibility of a weak solution p(x,/), that is, one 
satisfying (2.33), which is continuously differentiable in two parts R, and 
R, of R, but with a simple jump discontinuity across the dividing | 
boundary, S, between R, and R,. We may integrate by parts in each of the 
separate regions R,, R,, and deduce from (2.33) that 


Slee Joarars ff [a+ Ge | gata 


+ f {[e]/+[Q(e)]m} ods =0, 
S 
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where (/,m) is the normal to S and [p], [Q(p)] denote the jumps across S. 
The line integral on S consists of the two contributions from the boundary 
terms of R, and R, obtained in the integration by parts. Since this equation 
must hold for all test functions φ, we deduce that (2.31) holds inside each 
of the regions R, and R,, but in addition we deduce 


[p]/+[Q(e)]m=0 on δ. 


This is the shock condition (2.32), since U= —//m. Thus weak solutions of 
this type would satisfy (2.31) at points of continuity and allow jump 
discontinuities satisfying the shock condition. Just what we want! 

At first sight, the weak solution concept appears to bypass the more 
involved and less precise discussion of the real physical processes. But it is 
not really so. Corresponding to the differential equation 


dp 


dp 
Or + c(p) a = () 


there are an infinite number of conservation equations 


af(e) ὃ5(ρ) 
Or ax 


Any choice which satisfies 


0. (2.34) 


5΄ (ρ) =f'(p)c(p) | (2.35) 


will do. For differentiable functions p(x, 7), these are all equivalent. How- 
ever, their integrated forms are not equivalent and lead to different jump 
conditions. The weak solution of (2.34) will require the shock condition 


—U[f(e)]+[2(e)]=0; (2.36) 


different choices of f and g lead to different relations between p,, p,, and 
U. Therefore a discussion of the physical processes is still necessary in 
order to pick out which weak solution is relevant to the particular physical 
problem at hand. 

From the differential equation 2.34 we can propose a candidate for a 
conservation equation in integrated form: 


5 J, Hoax +90) ]n,=0 (2.37) 


But whether this holds for nondifferentiable p can be decided only by 
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returning to the original formulation of the problem. In Section 2.2 we 
argued in the correct order: first (2.10) then (2.11). The reverse order, 
going from an equivalent partial differential equation to an integrated 
form, introduces the lack of uniqueness. 

If (2.37) is the true conservation equation, then (2.36) may be deduced 
as the shock condition by the same argument that was used in Section 2.3. 
Thus the correct choice of weak solution is made on the basis of which 
quantities are really conserved across the shock. In view of the lack of 
uniqueness and possible confusion, it is felt that the weak solution concept 
is not particularly valuable in this context and it is better to. stress that 
physical problems are first formulated in the basic integrated forms from 
which both the partial differential equations and the appropriate jump 
conditions follow. 

A looser form of the weak solution idea is sometimes useful in a 
preliminary look at a problem. If, for example, we ask whether (2.34) might 
admit moving discontinuities as part of the solution, we might try 


Τρ) =fo(x) H(x—- Ut) +f, 
&(p) =89(x) Η(χ -- Ut) +2, 


where H(x) is the Heaviside step function and ἢ, g, are continuous 
functions. On substitution in the equation we obtain 6 function terms 


(— Ufy+ 8)8(x— Ut) 
plus less singular terms. We deduce that 
" Uf + 80 a 0, 


and this is the shock condition (2.36), since fy=[f], 89=[g]. This does not 
avoid the lack of uniqueness, of course, and it also uses ὃ functions in a 
slightly dubious way. The use of 6 functions in nonlinear problems usually 
is excluded because there is no satisfactory meaning to powers and 
products of such generalized functions; we have retained an artificial 
linearity by expressing f(p) and g(p) separately, rather than using a single 
expression for p. Of course, the justification of the 6 function argument is 
via the weak solution. 

In contrast, consider the same question of the possibility of moving 
discontinuities for (2.20) written in the form 


dp ὃ09(ρ) ὃρ 


δὲ ox aye: 
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If p and Q(p) are expressed in terms of H(x— Ut), the term 02p/ 0x? will 
have a term [p]6’(x— Ur) and there is no other term as singular as 
δα — Ut) to balance it. We conclude that [p]=0 and that discontinuities 
are not possible. This is clearly a useful tool for a preliminary assessment. 


2.8 Shock Fitting; Quadratic Ο(ρ) 


After discussion of these various points of view we now turn to the 
analytic problem of fitting discontinuous shocks satisfying 


= O(p2) — Ο(ρι) 


U 2.38 
— (2.38) 
into the continuous solution 
p=f(§), 
(2.39) 
x=§+F(E)t. 


Any multivalued part of the wave profile must be replaced by an 
appropriate discontinuity, as shown in Fig. 2.8.* The correct position for 
the discontinuity may be determined by the followng ingenious argument. 
Both the multivalued curve and the discontinuous curve satisfy conserva- 
tion. Therefore {pdx under each curve must be the same; hence the 
discontinuity must cut off lobes of equal area, shown shaded in Fig. 2.8. 


Po 


s(t) x 
Fig. 2.8. Equal area construction for the position of the shock in a breaking wave. 


*The figure is drawn for the case c’(p)>0 but all the formulas in this section are correct for 
either case. 
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This determination, although quite general, is not in a convenient 
form for analytic work. The general case gets complicated and it is 
worthwhile to do a special case first. The special case is again a quadratic 
expression for Q(p). This includes the case of weak disturbances about a 
value p=, since Q(p) can then be approximated by 


= Ο(ρο) + O'(Po)(p~ po) + ΣΟ" (0) (P— po)” 


and for this reason it has considerable generality. 
We consider 


O(p) =ap*+ Bp+y. 
Then 


c(p) =Q'(p) =2ap+ B 
and the shock velocity (2.38) becomes 


U= 5 (e+e), 


where c,=c(p,), C.=c(p,). 
The simplicity of this case is that the whole problem can be written in 
terms of c. The continuous solution is 


= ὁ... (2.40) 
x=£+ F(£)t, 
and shocks must be fitted in such that 
ὑπ Σ(ο τω) Σ { F(E,) + FG) }; (2.41) 


where ξὶ and ἔξ; are the values of ξ on the two sides of the shock. Since p 
and c are linearly related, the conservation of p implies conservation οἵ-ς; 
that is, f{ cdx is conserved in the solution. Therefore for this special case 
the shock construction for the (,x) curve in Fig. 2.8 applies equally well to 
the (c,x) curve. 

It is convenient to note for future reference that this solution in terms 
of c solves the equation 


c, + cc, =0, (2.42) 


with weak solutions chosen to satisfy the conservation law 


c,+ (4c?) =0 (2.43) 
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F ς €=£5 


Eo Cit. Ee s (t) x 
α b 


Fig. 2.9. Equal area construction: (a) on the initial profile; (6) on the transformed breaking 
profile. 


so that the shock condition is 


2 1.2 - 
0, 50) ] 
ie ν Wile ee 2.44 
U a 5 (οι + 2) (2.44) 


Equation 2.42 is true for general Q(p), since it is c’(p) times ρ, + c(p)p, ΞΞ0; 
(2.44) is always a possible weak solution, but it is the correct choice only 
when Q(p) is quadratic or approximated by a quadratic since it is only in 
that case that the integrated form of (2.43) holds across discontinuities. 
The shock construction can now be combined with the continuous 
solution (2.40). Since we now work with c the awkward distinction between 
the two cases c’(p) 20 does not arise. According to (2.40) the solution at 
time ¢ is obtained from the initial profile c= F(€) by translating each point 
a distance F(é)¢ to the right, as shown in Fig. 2.9. The shock cuts out the 


ξ 2 SB €| x 


Fig. 2.10. The (x,#) diagram associated with the shock construction in Fig. 2.9. 
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part corresponding to ,>€>6,. If the discontinuity line in Fig. 2.95 is 
also mapped back as in Fig. 2.9a, it is a straight line chord between the 
points =, and €=§, on the curve F(é). Moreover, since areas are 
preserved under the mapping, the equal area property still holds in Fig. 
2.9a; the chord on the F curve cuts off lobes of equal area. The shock 
determination can then be described entirely on the fixed F(é) curve by 
drawing all the chords with the equal area property. The pairs €=é,, €=&, 
at the ends of each chord relate characteristics which meet on the shock. 
The (x,¢) plane is shown in Fig. 2.10. The equal area property can be 
written analytically as 


5 (FE) + FG) (Ei &) = [PO ab (2.45) 


since the left hand side is the area under the chord and the right hand side 
is the area under the F curve. If the shock is at x =5s(t) at time ¢, we also 
have 


s(t)=§,+ F(E,)t (2.46) 


s(t) =f + F(&)t (2.47) 


from the second of (2.40). The three equations (2.45)-(2.47) determine the 
three functions s(7), €,(¢), and £,(2). The determination of s(t) is implicit 
involving the two additional functions €,(t) and &,(t) which determine the 
characteristics meeting the shock at time ¢. The values of c on the two sides 
of the shock are c, = F(é,) and c, = F(&,); the values of p are obtained from 
ο. 

Since the shock determination (2.45)-(2.47) was obtained geometri- 
cally, it is interesting to check directly that it does indeed satisfy the shock 
condition (2.41). We may write this verification as an independent deriva- 
tion of the result in (2.45). We have to find three functions s(t), &,(2), &(0) 
which satisfy (2.46), (2.47), and 


(1) =5 (FE) + F(G)}- (2.48) 


(Dots denote ¢ derivatives.) From (2.46) and (2.47), we have 


coe ξι τ 
“τ FE) κ᾽ oe 
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and 
§(t) = {1+ 4F'(,) }&,+ F(&), 
s(t) = {1+ tF'(&) }& + F(é). 


If we take the mean of these last two expressions for $ in order to preserve 
symmetry, substitute for ¢ from (2.49), and then substitute in (2.48), we 
obtain 


(F’(&)8,+ F'(&)b) (8) + 5 (P(E) + FE) } (6-8) 


N|— 


ae F(é,)& = F(&,)&). 


This may be integrated to (2.45); the constant of integration may be 
dropped since the starting point of the shock, ¢, = £,, must be a solution. 

The expression (2.49) for the time can be used to follow the develop- 
ment of the shock. Since ¢>0, all the relevant chords in Fig. 2.9a must 
have negative slope. Since é, >&, by the choice of notation, F(£,) > F(é)); 
that is, c, >c, as we decided from the breaking condition. The earliest time 
for the shock corresponds to the steepest chord. This is the limit when the 
chord is tangent at the point of inflexion =, say. Then F(é,)= F(&,) so 
the shock starts with zero strength and the time is 


] 


7: 


This all fits with the conditions for the first point of breaking discussed in 
(2.8). For an F curve like Fig. 2.9a, the chords tend to the horizontal as 
too, with F(é,) — F(é,;) 0; hence c,— c,;->0 and the shock strength tends 
to zero as [> 0. 


Single Hump. 
To study the shock in detail, we suppose first that F(€) is equal to a 


constant cy outside the range O<£<L, and F(€)>cy in the range. Equation 
2.45 may be written as 


τ σφ + δ) -2ω) τ} [{π0 τ ὦ) δὲ 


As time goes on, é, increases and eventually exceeds L. At this stage 
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F(€,)= cy and the shock is moving into the constant region c=c,). The 
function ¢,(¢) can then be eliminated, for we have 


] L 
7 {F(&)—co} (Ei) =f (FE)—co}d& ἐπ 


Therefore 


5 (Fl) ~co}t= {{{π0 ~ οὐ a. 


At this stage, the shock position and the value of c just behind the shock 
are given by 


s(t) =§ + Ε(ξ,):, 


c=F(&), 


(2.50) 


where ξ, (1) satisfies 


5 ( F()—co}t= [τ ~ co} ἀξ 


As t->00, we have &,-0 and F(§,)—cy; hence the equation for ξ),([) 
takes the limiting form 
I 2 
2 { F(&)—¢o} t~A, 


where 
L 
A= f { F(£) — co} dé 


is the area of the hump above the undisturbed value cy. We have &,- 0, 
F(§,)~cy+ V2A /t . Therefore the asymptotic formulas for s(¢) and c in 
(2.50) are 


S—™Col + ν 24: 3 


2.51 
a (2.51) 
er | ak 


at the shock. The shock curve is asymptotically parabolic and the shock 
strength (c— c¢y)/C, tends to zero like t~'/?. 
The solution behind the shock is given by (2.40) with 0 « ξ « ξ,. Since 


48 WAVES AND FIRST ORDER EQUATIONS Chap.2 
ς 


Co = 


x 
Cot Cot + JS2At 


Fig. 2.11. The asymptotic triangular wave. 


€,0 as ¢->0, all the relevant values of € also tend to zero and the 
asymptotic form is 


c~ Cot <x <Cot + ναι. (2.52) 


The asymptotic solution and the corresponding (x, 1) diagram are shown in 
Fig. 2.11. Notice that the details of the initial distribution are lost; only 


L 
A= J { F(€)— co} ἀξ appears in the ultimate asymptotic behavior. 


N Wave. 


Other problems can be worked out in a similar way. One important 
case is when F(§) has a positive and a negative phase about an undisturbed 
value cp, as in Fig. 2.12. There are now two shocks, corresponding to the 
two compression phases at the front and at the back where F’(é)<0. The 
families of chords for each are shown in the figure. As too, the pair 
(€,,£,) for the front shock approach (0,00), whereas for the rear shock, 
(€,,€,) approach (— 00,0). Asymptotically the front shock is 


S~Cott+ V2At 


Fig. 2.12. Shock construction for an N wave. 


ee ee SS Se pe a Oe ee «πὸ “παν er emt ἀν ee ee ee ee eee ae αὐτὸς ee ee 


Cot+ /2At 


NZ 


Fig. 2.13. The asymptotic N wave. 
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1/24 
C—Co™ <a ite 


where A is the area of the F curve above c=cy. The rear shock has 


C=C γ΄" 
oc vel os 
t 


where B is the area below c=cy. The solution between the shocks is again 
asymptotically 


and the jump of c is 


3 


cme, Cot - V2Bt <x <cot+ ναι. (2.53) 


The asymptotic form and the (x,f) diagram are shown in Fig. 2.13. 
Because of the shape of the wave profile, it is known as the N wave. 
Periodic Wave. 


Another interesting problem is that of an initial distribution 
2 
c= F(t) =c,+ asin a (2.54) 
In this case, the shock equations (2.45)-(2.47) simplify considerably for all 


times 1. Consider one period 0<é<A as in Fig. 2.14. Relation 2.45 
becomes 7 


(€,—§) sin τ (ξι Ὁ ξ2) cos χ(ξι —§)= Α sin τ (ξ, -- ξ3) sin τ (€,+&), 


F 


O€, EX : 


Fig. 2.14. Shock construction for a periodic wave. 
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and the relevant choice is the trivial one 


+ 
ἐπ πιο ἘΣ 6: Gti. Gaetan. 


λ 
From the difference and sum of (2.46) and (2.47), we have 
js ξ 1 ξ, 
dasin Ὁ (ξ Ἐὰν ἢν 
S= Cot + Ἢ 


2 
respectively. The discontinuity in c at the shock 15 


2πξ; _ 2πἢ) 
x — asin x 


Ο,-- (1 = asin 


=2asin (€,—§). 


If we introduce 


ἐπέ, λέ, £,+6,=A, 


we have 


S=Cot+ =, (2.55) 


The shock has constant velocity cg and this result could have been deduced 
in advance from the symmetry of the problem. The shock starts with zero 
strength corresponding to @=0 at time t=A/2ma. It reaches a maximum 
strength of 2a/c, for 0=7/2, t=A/4a, and decays ultimately with 0-7, 
[—- 00, 


i dl (2.56) 
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- Οοἵ 
x 3X ae 
2 


2 2 
Fig. 2.15. Asymptotic form of a periodic wave. 


Ι! 
ΟΣ 


It is interesting that the final decay formula does not even depend 
explicitly on the amplitude a. However, the condition for its applicability is 
t>)/a. For any periodic F(g), sinusoidal or not, £, —§,—A as t->00; hence 
from (2.49) 


Co Co ~~ Cot 
Between successive shocks, the solution for c is linear in x with slope 1/t 
as before, and the asymptotic form of the entire profile is the sawtooth 
shown in Fig. 2.15. 


Confluence of Shocks. 


When a number of shocks are produced it is possible in general for 
one of them to overtake the shock ahead; they then combine and continue 
as a single shock. This is also described by our shock solution. Consider 
the F curve in Fig. 2.16. Two shocks are formed corresponding to the 
points of inflexion P and Q with families of equal area chords typified by 
P,P, and Q,Q,. As time goes on the points Q, and P, approach each other 
until the stage in Fig. 2.16b is reached where a common chord cuts off 
lobes of equal area for both humps. At this stage the characteristics 
corresponding to P, and Q, are the same, and therefore the shocks have 
just combined into one as shown in the (χ, ἢ) diagram Fig. 2.17. All the 
characteristics between ΟΣ and P; have now been absorbed by one or 
other of the shocks; a single shock proceeds using chords P,Q,’ as in Fig. 
2.16c, counting only total areas above and below the chord in the equal 
area construction. 


a 
a 


Fig. 2.16. Construction for merging shocks. 


F Qo 
Po 
Q 
Q, \P 
P| 
a ξ 
Q2 
/ 
ΘΙ 
P2 
p/ 
b ξ 
ΩΣ 
pi 
a ξ 


Fig. 2.17. The (x,t) diagram for merging shocks corresponding to Figs. 2.16. 
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2.9 Shock Fitting; General Q(p) 


For the general dependence of q on p, the shock determination can be 
put into an analytic form similar to (2.45)-(2.47). The complication is the 
nonlinear relation between c and p, so that the construction in Fig. 2.8 15 
correct for p but not for c. Accordingly, we must work with p. But if we 
then plot (p,x) curves similar to Fig. 2.9, the discontinuity line does not 
map into a straight chord because the translation is proportional to c and 
not to p. Thus the mapping back onto the initial curve is no particular 
advantage. 

However, we can proceed as follows. Introduce the function &(p), 
which is the inverse of 


p=f(&), 


and introduce also the function X(p,t), which is the inverse of the function 
p=p(x,t) in the multivalued solution. That is, we fix attention on a 
particular value of p and note where it is now, X(p,¢), and where it was 
initially, (9). From the equation for the characteristics we have 


X(p,t)=c(p)tt+&(p). (2.57) 


Consider the shock at s(t) and let p, and p, be the values ahead and 
behind the shock, respectively. The equal area construction in Fig. 2.8 may 
be written 


[-X(0.0) do (ρι -- ρ)5(). 


[This is true for either case c’(p)20. We always take p, to be the value 
ahead of the shock and p, to be the value behind. If c’(p) >0, then p, >; 
if c’(p) <0, then p,<,.] Hence from (2.57), 


fel) +€(0) } ἀρΞε (ρι-- ρῳ)5(1). 
Since c(p)= Q’(p), this may be written 
(4ι-- 42)4~ (ρι-- p2)3(1) =~ f" 80) ao. (2.58) 


The right hand side can be integrated by parts and rewritten as 


ξὶ 
πριξι Ἔρχξ Ἔ [ ρ(ξ) ἀξ. 
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The shock position s(t) is given by 


s(t) =&, + cy, 
(2.59) 
s(t) =& +56; 


these may be solved for s(t) and ¢ and substituted in (2.58). Finally, (2.58) 
becomes 


ξγ--ξ; 


gy 
{ (42-91) — (P2¢2— 11) } ε ράξ, (2.60) 
where p, 4, and c are all to be evaluated as functions of ξ through the 
relations 


p=f(§), @g=O(f(E)), c=O'(f(E)) (2.61) 


and subscripts indicate values for €=£, and €=&,. [This makes it a little 
clearer than using f(€) for p, F(§) for c and introducing a new symbol for q 
as a function of ¢.] Equations 2.59 and 2.60 give three relations for 5 ({), 
£,(t), (4. Again it may be verified directly by differentiation that the 
shock condition 


42) 
P2— P1 
is satisfied. When q is quadratic in p, it is easily verified that (2.60): reduces 
to (2.45). The problems like a single hump or an N wave can be analyzed 
as before and are qualitatively similar. The asymptotic formulas (2.51), 
(2.52), and (2.53) still apply with the modification that 


A= σ(ου) [φρο ἀξ, 


and B is changed similarly. The expressions for p may be deduced from 
those for c, since the disturbance is weak in the asymptotic limit and 
P— Py=(C—Co)/c'(pp) to first order. 


2.10 Note on Linearized Theory 


When disturbances are weak, nonlinear equations are often 
“linearized” by neglecting all but the first order powers of the perturba- 
tions. For weak disturbances with (c —cy)/cy<1, the equation 


C, +c, =0 


56 WAVES AND FIRST ORDER EQUATIONS Chap. 2 
would be linearized to 
C, + coc, = 9. 


As noted earlier the solution of this equation is c—cyg=/f(x— Cot). The 
breaking effect and the formation of shocks are completely absent, yet we 
see from Figs. 2.11, 2.13, and 2.15 that these become crucial after a 
sufficient time, however weak the initial disturbance may be. Thus it is clear 
from comparison of the answers that the linearized approximation cannot 
be uniformly valid as too. 

This may also be seen directly, by looking at the linear theory as the 
first term in a naive expansion in powers of a small parameter. Suppose ε 
measures the maximum initial value of (c — cy)/ cp, and a solution is sought 
in the form 


c=Cytec, (x,t) +€%,(x,t)+-°°. 


When this expansion is substituted in c,+ cc, =0 and coefficients of εἴ are 
equated to zero, we have a hierarchy of equations starting with 


Ci, + C0C1,=9, 
(γι Ἔ C02, = — C1C 1x: 
C3p Ἢ Cola, = — C2C1y — C1Cy- 
These are easily solved successively since at each step we have 
φ, + Cop, = B(x, 1), 


where ® is known from the previous step. If we introduce the characteristic 
coordinate y = x — Cot, this may be written 


dp 
aes =O(y+coft,t). 
( ot -ε ω τ ) 


Therefore 
t 
o= f O( y+cot,7) dr+V(y). 
| 0 
The initial condition on c may be written 


C=CyoteP(x) at +=0, 
and it is satisfied by 


cj=P(x), ¢,=0 (n>1) at t=0. 
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Hence the complementary functions V(y) are zero in the solutions for the 
C,, n> 1. The first three c, are found to be 


c= P(y), 


C7, = —tP(y)P’(y), 
ι2 2p’\r 
Cy= > (Ρ΄ΡΊ. 


It is clear that in general c, will contain a term of the form t”~'R (y). 
Therefore the successive terms in the assumed series for c are of order 
«"t”—', and the series is not uniformly valid as too. 

The failure of the linearized theory, brought out strongly in the 
solution for the higher order terms, is that it approximates the characteris- 
tics as lines x—c t=constant. The slight inclination of the true 
characteristic lines, relative to each other, accumulates to a large dis- 
placement as too. The correct solution may be written as a telescoping 
function: 


C=Cot+eP(x-—ct), 
=CyoteP(x—[coteP Jt), 


and so on. The naive perturbation expansion can then be obtained by an 
inadvisable use of Taylor series! ' 


2.11 Other Boundary Conditions; The Signaling Problem 


The solution for the initial value problem has been given in great 
detail. Other boundary value problems can be solved in similar fashion. It 
is clear from the characteristic form (2.4) that the solution is determined 
once the value of p is given on any curve that intersects each characteristic 
once. Such a boundary value provides the initial conditions for integrating 
the two ordinary equations in (2.4) along the characteristic through that 
point. In principle, this is repeated at each point of the boundary curve to 
build up the solution in the whole region covered by the characteristics 
through the boundary curve. If the curve intersects characteristics twice, as 
does curve ABC in Fig. 2.18, the data can only be posed on AB or BC, 
otherwise the integration starting from AB, say, will conflict with the data 
on arrival at BC. Since the characteristics may depend on the solution, the 
region covered and the admissibility of the boundary curve cannot always 
be decided in advance. 
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ς 


Fig. 2.18. Characteristics and initial data. 


A standard boundary value problem is the so-called signaling problem 
for which 
ρΞΞρῃ for x >0, ἐξξ0, 


p=2(t) for t>0, x =0, 


and the solution is required in x >0, t>0. Of course, this problem only 
arises in the case c= Q'(p)>0. The (x,t) diagram is shown in Fig. 2.19. 
Characteristics start from the positive x axis and the positive ¢ axis. Those 
from the ¢ axis have p=pp, C=C(Po)=Cp and are straight lines x — cof 
=constant. Thus they predict 


ρξξρ» C=Cgo MxX>Col. (2.62) 


For the characteristics starting from the ¢ axis, let a typical one start at 
t=7r. Then 
p=g8(t), 


2.63 
x=G(r)(t-7), ( ) 


X= § P*Po ‘ 


Fig. 2.19. The (x,#) diagram for the signaling problem. 
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where G(r)=c{ g(7)}. This gives the solution implicitly in terms of r(x, ἢ). 
The solution can be related to the solution of the initial value problem 
in two ways. The first method is to note that the two solutions agree if 


€=—71G(r), f(é)=g(r),  F(E)= G(r). (2.64) 


This corresponds to continuing the characteristic through t=7, x =0, back 
to the x axis and denoting the point of intersection as x = §; in this way the 
signaling problem is formulated as an initial value problem. The alterna-— 
tive method is to interchange the roles of x and ¢, and of q and p; in the 
formulas, do/dq=1/c will appear in place of dq/dp=c. 

Any multivalued overlap in the solution (2.63) has to be resolved by 
shocks. If G(0+)> co, there will be an overlap immediately since the first 
characteristic x =tG(O+) of the disturbed region is ahead of the last 
characteristic x =cot of the undisturbed region. In that case a shock of 
finite strength will start from the origin. The shock determination can be 
taken from the results for the initial value problem using either of the 
above methods, or it can be developed independently. If characteristics 
T(t) and 7,(t) meet the shock at the time ¢, then from (2.63), 


s(t) =(t-17,)¢,, c,=G(7,), 
s(t) =(t-1)¢,, C,= G(1)), 


and the formula corresponding to (2.60) may be written 


(2.65) 


ΤΊ 


Τὸ Ξ 72 
{ (42— P22) οι-- (91-011) 61) ae = =f q(t) dr. (2.66) 
2 l ΤΊ 

Equations 2.65 and 2.66 provide the three implicit equations for the 
functions 7,(¢), T,(¢), s(t). The most important case is that of a front shock 
formed at the origin [1.6., G(0+)>c ] and propagating into the undis- 
turbed region. Then we have p,;=py, C,=Cg 4|Ξ 4; and 7, can be 
eliminated from (2.65) and (2.66). At the same time we drop the subscript 2 
and reduce the shock relation (2.66) to 


{ (4-40) ~ (0 po)e} (t=) =~ [{4(π0- ao} ae’ (2.67) 


Here p, g, and ὁ are functions of τ determined from 


p=g(t), q=Q(g(r)), c=Q'(g(r)); 


they are all known functions of each other and when one is prescribed as a 
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function of τ the others follow. Equations 2.63 determine the solution in 
the disturbed region behind the shock; (2.67) determines the appropriate 
value τί at the shock and on substitution in (2.63) we have both the 
position of the shock and the value of ρ just behind it. 

In the initial motion of the shock, the value of τί) in (2.67) is small 
and we have 


{ (4-0) = (ρ,-- Po) ¢; } (t—7)=—(4;—)T+ O(77), 


where p,, 4. and c, are the initial values on x =0; that is, p,=g(0+), and so 
on. Therefore 


From (2.63), the shock position is 


x=(t—1)c,+ O(t’) 


= 2% 14 0(P). 
P;~ Po 

The shock starts with velocity (q;— 40) 7 ρ; — Po), and this result can be seen 
directly from the shock condition. If g(r) remains constant and equal to p,, 
this is exact for all ¢ and the solution is a shock of constant velocity 
separating the two uniform regions p= py, and p=g,. 

If g(r) returns to po, the shock ultimately decays. For a single positive 
phase with g(r) returning to py at r=T, the asymptotic behavior corre- 
sponds to T>T, t>0, ρ--Ὅρρ in (2.67). In this limit, (2.67) becomes 


] 7 2 τ , / 
56'(Po)(0~ po) t~ [{{4{(τ7) — 40} ar", 
. 0 
and the expression for the shock position in (2.63) becomes 


xX—~ Cot + c'(P9)(P — Po)t. 


Therefore at the shock we have 


ge denigiing te 
Po c' (po) t ’ 0 i ’ 


X~Cof + V2At, 


(2.68) 
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where 
T 
A= c'(po) f (q— 40) ar. 
0 


In the region behind the shock, 


c~ Col <x <eot + V2At, 


(c— ορ) 7 χ -- Cot eee? 


PPO = a7. ee oe 
π΄ σ(ρ) (Po) 


These results are very similar to those for the initial value problem. Other 
cases may be studied in the same way. If the positive phase is followed by 
a negative phase, there is a second shock whose asymptotic behavior is 
given by (2.68) with the modifications that A is replaced by the corre- 
sponding integral over the negative phase and the signs are changed 
appropriately. The ultimate form is an N wave with the formulas (2.69) 
extended back to the rear shock. 


2.12 More General Quasi-Linear Equations 


The general quasi-linear equation of first order is linear in p, and p, 
but may also have an undifferentiated term. The coefficients of p,, p, and 
the undifferentiated term may be any functions of p, x, t. If the coefficient 
of p, is nonzero, the equation may be divided by this quantity and written 
in the form 


p, + cp, = b, (2.70) 
where b and c are functions of p, x, and ft. Such equations can again be 


reduced to the integration of ordinary differential equations along 
characteristic curves by writing (2.70) in the characteristic form 


ἄρ᾽ dx 
τι  δίρ,χ,1), He CAP xt): (2.71) 


In particular the initial value problem with initial data 
p=f(x), t=0 


is solved by integrating the coupled ordinary differential equations in 
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(2.71) subject to the initial conditions 
p=f(&), x=§, at t=0. 


Each choice of € leads to the determination of the characteristic through 
x =£€ and the value of p along it. The solution in a whole region is obtained 
by varying the parameter ξ. 

When 50, p is not constant along the characteristics and generally 
the characteristics are not straight lines. But the method of determination 
is qualitatively the same. Again waves may break with the characteristics 
overlapping in the (x,f) plane. Again the multivalued solutions may be 
avoided by including suitable discontinuities. 

Some interesting cases concerning breaking arise and we will consider 
two examples here. 


Damped Waves. 
Consider as a first example the case 


c, + cc, + ac=0, (2.72) 


where a is a positive constant. In characteristic form it is written 


dc dx _ 


Ht ac, Hon (2.73) 


If we take the initial value problem, the first equation may be integrated to 


c=e “f(é). (2.74) 


Then the second equation is 


and we require x =£é at t=0. The solution is 
l—-— e # 
x=E+ ero) (2.75) 


The nonlinearity gives the typical distortion of the wave profile, but 
simultaneously the wave is damped due to the presence of the un- 
differentiated term in the equation. 
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Consider now the question of breaking. This is most easily investi- 
gated by seeing whether the characteristic curves (2.75) have an envelope. 
An envelope of these curves satisfies the derivative of (2.75) with respect to 
the parameter &: 


ene ἰ-ε νῷ. (2.76) 


Since a>0, ¢>0, this is possible if and only if 
f'(§&) <a. (2.77) 


Thus breaking occurs if and only if the initial curve has a large enough 
negative slope; the damping may prevent breaking if the compressive 
phase is not steep enough. 

Although the appropriate equations are more complicated than those 
just considered (see Chapter 3), this type of inequality determines whether 
the tidal variation propagating up a river will be strong enough to produce 
breaking into a bore, or whether the friction will dominate. For most rivers 
the frictional effects dominate. However, those famous rivers that have a 
bore have high enough tides at the mouth and additional reinforcement 
from rapid narrowing of the river to overcome the various frictional 
effects. This theory has been discussed and applied by Abbott (1956). It 
will be referred to again in Section 5.7. 


Waves Produced by a Moving Source. 


If b is independent of p in (2.70), it may be interpreted as an external 
source of the fluid. A particularly interesting case is when the source 
distribution moves with constant velocity V. There is a recent example, in 
the more complicated context of magnetogasdynamics, where a wave 
motion is produced by applying a moving force to the fluid (Hoffman, 
1967). We can examine some of the qualitative effects in our simple model. 

We take 


b= B(x- Vt), 


where V is constant and B(x) is a positive function tending rapidly to zero 
as |x|>0o. We assume that p has a constant value p=p, at ¢=0. If 
Cy= C(po), there are important differences depending on whether the source 
moves supersonically with V >cy or subsonically with V<cp. 

The surprising result is that a supersonic source need not produce a 
shock, whereas a subsonic source always does. This can be seen quite 
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simply by looking for a steady profile solution with 


p=p(X), X=x— Vt. (2.78) 


Since we are only looking at models anyway, with the aim of showing 
qualitative effects, let us take the special case 


c,+.cc, = Β(χ-- Vt). (2.79) 
Then in the steady profile solution 


(c—V)cy=B(X), 


MV =e) S476) = -ἰ Βο)φΦ. 


In the supersonic case, V Ὁ ορ» the solution for c is 


paige ((v-ay?-2f Bra} (2.80) 
If 


Ves [17 aoe). (2.81) 


(2.80) is a satisfactory single-valued solution for all X and no shock is 
required. The criterion (2.81) is an inequality between the speed V—c, and 
the total source strength 


[Bw 


We can get a feeling for the result by the following argument. If the 
source moves with a large supersonic speed, the only shock that could keep 
up with it would be strong. But if the source is relatively small, a strong 
shock cannot be produced and is not required. 

When the inequality (2.81) does not hold, (2.80) breaks down for 
X < Xo, where 


1/2 


V—eo= [2 B(») 4 


At X=X,, c=V and transients from the starting conditions can and do 
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overtake the wave. The solution cannot be completed without a detailed 
discussion of the transients. Similarly, in the subsonic case the solution 
cannot be established without full discussion of the transients. In both 
these cases shocks are found to occur. A detailed discussion is given by 
Hoffman (1967). 


2.13 Nonlinear First Order Equations 


The discussion of quasi-linear equations has raised many questions 
that require further consideration. Before proceeding, however, we note 
briefly that similar constructions using characteristics go through in the 
general case of fully nonlinear first order equations. These results will also 
be needed later. 

It will be useful to have the characteristic form for an equation 
in n independent variables (x,,...,x,). We consider, then, a function 
(x,,---,X,) which satisfies a differential equation 


H(p,¢,x) ΞΟ, (2.82) 


where p and x denote the vectors with components p, and x,, i=1,...,n, 
and 


(2.83) 


We may motivate the characteristic form by asking whether there are 
curves in x space with special properties akin to the characteristics of the 
quasi-linear equations. Any curve © in x space may be written in para- 
metric form 


x=x(A). 


The total derivative of φ along the curve Ὁ is* 


Is there a choice of the direction vector dx, ,/aX which has special signifi- 
cance for the solution of (2.82)? In the quasi-linear case where we have 


*We use the summation convention that a repeated subscript is automatically summed over 
Lyscaghh 
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H =c,(¢,x)p,; — δ(φ, Χ), we choose 

cd 

AL ο(φ, Χ) 
so that dp/dA=c;, p;; we then use the equation to obtain 


do 
dr = b(¢,x). 


But generally the p, cannot be eliminated in the expression for dp/dd 
whatever the choice of the dx,/dA. We do not have an ordinary differen- 
tial equation for φ alone; the p, are involved. However, consider in 
addition the total derivatives of the p, on ©. We have 


dp, 4|94Φ = 0*o ax; 
Ox; dx,0x, dr’ 


Ah aK (2.84) 


and the x, derivative of (2.82) yields 


0*> 0H . OH 9%. OH 
τον ap, cra aa ee (2.85) 


Comparing the two, we see the special advantage in choosing curves ὦ 
defined by , 


et = “e (2.86) 
For in that case (2.84) may be calculated from (2.85) as 
Bi βῆ. ἐπ (2.87) 
Then if we add 
dp _ oH (2.88) 


(2.86)-(2.88) are a complete set of (2n+1) ordinary differential equations 
for determining a “characteristic curve” x,(A) and the values of ¢ and p,; 
along it. In principle, the solution in a whole region can be obtained by 
integrating these characteristic equations along the characteristics covering 
the region. 
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In the special case of the quasi-linear equation, H =c;($,x)p; — δίφ, Χ), 
(2.86) and (2.88) reduce to 


aX; | 
ar = c;(¢,x), 


do 
Dh TPG = δί(φ,Χ), 


which may be solved independently of (2.87). In the earlier discussion one 
of the x, was the time ¢, the corresponding c, was unity and the parameter A 
was { itself. 


CHAPTER 3 


Specific Problems 


In this chapter the basic ideas developed so far are applied in more 
detail to the particular cases raised in Section 2.2. At the same time, the 
general ideas can be taken further on the basis of specific sets of equations. 


3.1 Traffic Flow 


The application of these ideas to traffic flow was formulated and 
discussed independently by Lighthill and Whitham (1955) and Richards 
(1956). It is clear in this case that the flow velocity 


V(o)= ee 


must be a decreasing function of p which starts from a finite maximum 
value at p=O0 and decreases to zero as p—p,, the value for which the cars 
are bumper to bumper. Thus Q(p) is zero at both p=0 and p= p,, and has a 
maximum value qg,, at some intermediate density p,,. It has the general 
convex form shown in Fig. 3.1. Actual observations of traffic flow indicate 
that typical values for a single lane are p;~225 vehicles per mile, p,,~80 
vehicles per mile, g,,~1500 vehicles per hour. It appears to be roughly 
correct to multiply these values by the number of lanes for multilane 
highways. It is interesting that, according to these figures, the maximum 
flow rate g,, is attained at a low velocity in the neighborhood of 20 miles 
per hour. 
The propagation velocity for the waves is 


c(p) = Q'(p)=V(p) + pV'(p). 


Since V’(p) <0, the propagation velocity is less than the car velocity; waves 
propagate backward through the stream of traffic and drivers are warned 
of disturbances ahead. The velocity c is the slope of the (g,p) curve so the 
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Fig. 3.1. Flow-density curve in traffic flow. 


waves move forward or backward relative to the road depending on 
whether p<p,, or p>p,,. At the maximum flow rate, p=p,,, the waves are 
stationary relative to the road, so the propagation velocity relative to the 
cars is then the same as g,,/p,,~20 mph. 

Near p=p,;, we can make a rough estimate on the basis of a simple 
reaction time argument. If we assume that a driver and his car take a time 
δ to react to any change ahead, then the gap between cars should be kept 
at V6 for safety. If h is the headway, defined as the distance between the 
front ends of successive cars, and L is the typical car length, this leads to 


Since h=1/p, L=1/p,, we have 


vie)=F (2-1), O0)=£(e-0). 


One should probably interpret this as an estimate of the slope of the Q(p) 
curve at p,, rather than as a realistic prediction of a linear dependence on 
p. In any event, it gives c,= —L/6 for the propagation velocity there. In 
the traffic flow context 6 is usually estimated in the range 0.5—1.5 sec, 
although in other circumstances the human reaction time can be much 
faster. With L =20 ft, 5=1 sec, we have c;~ — 14 mph. 

Greenberg (1959) found a good fit with data for the Lincoln Tunnel in 
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New York by taking 


θ᾽; 
Ο(ρ) Ξαρίορ --, 


with a=17.2 mph, p,=228 vpm (vehicles per mile). For this formula, the 
relative propagation velocity V—c is equal to the constant value a at all 
densities. The values of p,, and q,, are p,, ΞΞ 83 vpm, 4,, = 1430 vph (vehicles 
per hour). The logarithmic formula does not give a finite value for V as 
p—0, but the theory would be on dubious ground for very light traffic so 
this point alone is not important. With a finite maximum V and a finite 
V'(p), we have c—>V as p—0, so one should expect V—c to decrease at the 
lighter densities. 

Since Q(p) is convex with Q”(p)<0, c itself is always a decreasing 
function of p. This means that a local increase of density propagates as 
shown in Fig. 3.2 with a shock forming at the back. Individual cars move 
faster than the waves, so that a driver enters such a local density increase 
from behind; he must decelerate rapidly through the shock but speeds up 
only slowly as he leaves the congestion. This seems to accord with 
experience. The details can be analyzed by the theory of Chapter 2. In 
particular the final asymptotic behavior is the triangular wave which is the 
last profile in Fig. 3.2. The length of the wave increases like t'/* and the 
shock decays like 1 12 The actual analytic expressions are 


C™ D— Po 757. \ 2 for Col — ν 2Bt <X<Col, 
c’(Po)t 


where 


B= le(eodl [ Ο-- Po) ax. 


Fig. 3.2. Breaking wave in traffic flow. 
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The shock is at 
x= Col = ¥ 2Bt 


and the jumps of c and p at the shock are 
ν t : le’(P9)| t 


Traffic Light Problem. 


> 


A more complicated problem is the analysis of the flow at a traffic 
light. We construct the characteristics in the (x,t) diagram. These are lines 
of constant density and their slopes c(p) determine the corresponding 
values of p on them. So the problem is solved once the (x,/) diagram has 
been obtained. 

Suppose first that the red period of the light is long enough to allow 
the incoming traffic to flow freely at some value p; <p,,. Then we may start 
with characteristics of slope c(p,) intersecting the ¢ axis in the interval AB 
in Fig. 3.3; AB is part of a green period. [The (x, ¢) diagram 1s plotted with 
x vertical and ¢ horizontal since this is the usual practice in the references 
on traffic flow.] Just below the red period BC, the cars are stationary with 
o=p,; hence the characteristics have the negative slope c(p,). The line of 
separation between the stopped queue at the traffic light and the free flow 
must be a shock BP, and from the shock condition its velocity is 


_ q(p;) 
DP; ~ B; 
distance 
j S . R 
no cars no cars ' J 
here ῳ here “ ͵ 
lights ως ς ἘΠῚ Chir 0:2 220 
ἧ HES A --- —-\ SQ 
p 
/ 
/ 
/ ra 
/ 
/ 
“ys ver. Oo τοῖς ἃς ὅς ας wns. : . 
wae red -——+ green ----- Ged = Green ------- mine 


Fig. 3.3. Wave diagram for an efficient traffic light. 
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When the light turns green at C, the leading cars can go at the maximum 
speed since p=0 ahead of them. (The finite acceleration could be allowed 
for roughly by extending the effective red period.) This is represented by 
the characteristic CS with maximum slope c(0). Between CS and CP we 
have an expansion fan with all values of c being taken. Exactly at the 
intersection CQ, the slope c must be zero. But this corresponds to the 
maximum q=q,,. Therefore we have the interesting result that q attains its 
maximum value right at the traffic light. The shock BPQR is weakened by 
the expansion fan and ultimately accelerates through the intersection, 
provided the green period is long enough. The criterion for whether the 
shock gets through is easily established. The total incoming flow for the 
time ΒΟ is (t,+ ¢,)g; where ¢, is the red period BC and 1, is the part of the 
green period before the shock gets through. The flow across the intersec- 
tion in this time is 1 4,,. These two must be equal; therefore 


19; 
Gm ~~ q; 
For the shock to get through and the light to operate freely, the green 
period must exceed this critical value. 
If the shock does not get through, the flow never becomes free and the 


notorious traffic crawl develops. It is perhaps sufficient to show the 
corresponding (x, 7) diagram Fig. 3.4 without comment! 


5 


Higher Order Effects; Diffusion and Response Time. 


There are two obvious additional effects one may wish to include in 
the theory. One was mentioned in Section 2.4: the dependence of 4 on p, 
as well as p. This introduces in a rough way the drivers’ awareness of 
conditions ahead, and it produces a diffusion of the waves. The simplest 
assumption with the correct qualitative behavior is 


g=Q(p)— »p,, v= Κρ) -- 7 bx, (3.1) 


and one does not have much basis for any more complicated choice. 

The second effect is the time lag in the response of the driver and of 
his car to any changes in the flow conditions. One way to introduce this 
effect is to consider the expression for v in (3.1) as a desired velocity which 
the driver accelerates toward; therefore the equation 


[υ- V(p)+ *p, | (3.2) 


OF 00, = mes 
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Fig. 3.4. Wave diagram for the slow crawl at an overcrowded traffic light. 


may be introduced for the acceleration. The coefficient τ is a measure of 
the response time and is akin to the quantity ὃ mentioned earlier, Equation 
3.2 is to be solved together with the conservation equation 


p, + (pv), =0. (3.3) 


When », τ are both small in a suitable nondimensional measure, (3.2) is 
approximated by v=V/(p), and we have the simpler theory. With the 
higher order terms included in (3.2), we expect shocks to appear as smooth 
steps and so on. This is true on the whole, but the situation turns out to be 
more complicated. 

It is always helpful to get a first feel for a nonlinear equation by 
looking at the linearized theory, even though the linearization may have its 
own shortcomings, as we discussed in Section 2.10. If (3.2) and (3.3) are 
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linearized for small perturbations about p= po, v = vg = V (9), by substitut- 
ing 


P=Potl, D=Ugt w, 


and retaining only first powers of r and w, we have 
T(w,+ v9W,) = - | w—V'(p))r+—r, 
Po 


r,+ Uor, + Pow, = 9. 
The kinematic wave speed is 
Co= Po" (ρο) + V0); 


hence V’(p))= -- ὑρ-- Co)/Po- Introducing this expression and then 
eliminating w, we have 


(3.4) 


ar, or _ 8 (2 3, 
ὃ dx dx? 


When v=7=0, we have the linearized approximation to the kinematic 
waves: r=f(x—C gt). The term proportional to » introduces typical diffu- 
sion of the heat equation type. The effect of the finite response time 7 is 
more complicated, but a quick insight can be gained as follows. In the 
basic wave motion governed by the left hand side, r= f(x — cot), so that ¢ 
derivatives are approximately equal to — cy multiplied by x derivatives: 


0 0 


—- =~ —c¢—., 
Ot ° ax 


(3.5) 


If this approximation is used in the right hand side of (3.4), the equation 
reduces to 


or or 2) 9} 
a, t Cone = 1? (ὑπ 0) ra (3.6) 
There is a combined diffusion when 


but instability if 


p< (v9 —€9) 7- (3.8) 
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This is reasonable; for stability a driver should look far enough ahead to 
make up for his response time. 

The stability criterion can be verified directly from the complete 
equation (3.4) in the traditional way. There are exponential solutions of 
(3.4) with 


ro e tkx τ ίωι 


provided that 
τίω-- ok) + i(w— Cok) — vk? =0. 


The exponential solutions will be stable provided $w<0 for both of the 
roots w. It is easily verified that the requirement for this is (3.7), so the 
result of the approximate procedure is confirmed and extended to all 
wavelengths. 


Higher Order Waves. 


It is important to note that the right hand side of (3.4) is itself a wave 
operator and we may write the equation as 


tegen a tea a te-g (3.9) 
where 


C,=Upt Vv/t , C_=Up— ννήτ. 


It would be expected therefore that waves traveling with speeds c, and c_ 
also play some role. It would be premature to go deeply into this question 
at this stage, but one remark has great significance in interpreting the 
Stability condition. We shall see later in our discussion of higher order 
equations that the propagation speeds in the highest order derivatives 
always determine the fastest and slowest signals. Thus in the present case 
however small + may be provided it is nonzero, the fastest signal travels 
with speed c, and the slowest with speed c_. It is clear therefore that the 
approximation 

© + ep =0 (3.11) 
could only make sense if 


CSC 0s. | (3.12) 
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But this is exactly the stability criterion (3.7). So the flow is stable only if 
(3.12) holds, and then it is appropriate to approximate (3.9) by (3.11) for 
small +. There is a nice correspondence between stability and wave 
interaction. 

Equation 3.9 arises in several applications and a full discussion is 
given in Chapter 10. 


Shock Structure. 


The more complicated form of the higher order corrections introduces 
a new possibility in the shock structure. For the simple diffusion term used 
in Section 2.4 with ν᾿ Ὁ, a continuous shock structure was obtained. We 
shall see now that this is not always the case when there are additional 
higher terms. We look for a steady profile solution of (3.2)-(3.3) with 


p=p(X), v=v(X), X=x— Ut, 
where U is the constant translational velocity. Equation 3.3 becomes 


— Upy + (vp), =0 (3.13) 
and may be integrated to 


\ 
\ 


p(U—v)=A, : (3.14) 
where A is a constant. Equation 3.2 becomes | 
τρίυ-- U)vy + vpy t+ pv— O(p) =0. (3.15) 


Since v= U—A/p, this may be reduced to 
A* 
χυ- Py = O(p)-pU+A. (3.16) 


For 7=0, " 15 τε same as (2.21), as it should be. For τϑξθ, the possibility 
that ν -- A*r/p* may vanish introduces the new effects. 

As before we are interested in solution curves between p, at X= +00 
and p, at X=— oo. These values will be zeros of the right hand side of 
(3.16). For traffic flow c’(p)= Ο΄ (ρ) “0, so ρ) =e and the right hand side 
of (3.16) is positive for p,<p<p). If ν -- A*r/p” remains positive in this 
range, then py >0 and we have a omooe profile as in Fig. 3.5. In view of 
(3.14), the condition for ν — 43 τ7ρ2 to remain positive may be written 


y>(v-U)'r,  thatis, υ- ννγτ <U<v+Vv/7. (3.17) 
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| P=P| 
P P=P, ‘ 
P=P2 
X Χ 
Fig. 3.5. Continuous shock structure. Fig. 3.6. Shock structure with an inner dis- 
continuity. 


This is similar in form to the linearized stability criterion (3.7) with v, 
replaced by the local velocity v and cy replaced by the shock velocity U. 
We might also interpret it in a way similar to (3.12) as a warning of 
possible complications if a shock tries to violate the higher order signal 
speeds. However, it is not necessarily an unstable situation. The conditions 
for the uniform states at +00 to be stable are 


v,— Vv/t <¢e,<0,+ Vov/r , v»— Ve/T <ce,<0,+ Vo/t . (3.18) 


It is possible, in general, for these to be satisfied and yet (3.17) to be 
violated. When this is the case, »— Ar /p? changes sign in the profile, as in 
Fig. 3.6, and a single-valued continuous profile is no longer possible. 

In most problems of shock structure, when the profile turns back on 
itself in this way, it is rectified by fitting in an appropriate discontinuity. 
The situation again corresponds, strictly speaking, to a breakdown of the 
assumptions for the particular level of description, but the introduction of 
a discontinuity, provided it corresponds to a valid integrated form of the basic 
equations, avoids an explicit discussion of yet higher order effects. In the 
case of (3.2) and (3.3), it is not clear which conservation forms are 
appropriate for the discontinuity conditions nor what additional effects 
should be introduced. One expects a discontinuous profile shown by the 
full curve in Fig. 3.6, but the precise determination of the discontinuity is 
not clear for this case. In other cases discussed later the details can be 
completed. The point to stress here is that the discontinuities in the simple 
theory using 


p, + c(p)p, =0 


may be only partially resolved into continuous transitions in a more 
accurate formulation. 
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A Note on Car-following Theories. 


Considerable work has been done on discrete models where the 
motion of the nth car in a line of cars is prescribed in terms of the motion 
of the other cars. [See, for example, Newell (1961) and the earlier re- 
ferences given there.] If the position of the nth car is s,(¢) at time 1, the 
assumed laws of motion usually take the form 


5,(t+)=6 (5. «(ἡ —5,(4)}. (3.19) 


between velocity §, and headway h, =s,_,— 5, with a time lag A to account 
for the driver response time. If G(h,) is chosen to be linear in 4,, or if the 
equation is linearized to study fluctuations about a uniform state, solutions 
can be obtained by Laplace transforms. In general, however, one must 
appeal to computer studies. 

This type of model takes a more rigid view of how each individual car 
moves, so it is narrower in scope than the continuum theory, where the 
whole complicated behavior of the individuals is lumped together in the 
function Q(p) and the parameters » and τ. But each model leads to a 
particular form for these quantities, which may be helpful in interpreting 
observational data. Moreover, such models may lead to additional effects 
that cannot be seen in the continuum theory. 

To see the correspondence of the particular car-following model in 
(3.19) with the continuum theory, we first note the relation of G(h) to 
Q(p). In a uniform stream with equal spacing h, the velocities in (3.19) are 
all equal and are given by the relation v= G(h). Since h=1/p, v=q/p, the 
function Q(p) in the corresponding continuum equations is 


0(0)=a( +) 


If empirical or other information is known about G(h), it may be trans- 
ferred to information about Q(p) near p=p,. Of course at lower densities 
Q(p) will be affected more by cars overtaking and changing lanes. 

The wave propagation described by (3.19), in which the motion of a 
lead car is transmitted successively back through the stream, should be a 
typical finite difference version of the earlier continuum results with this 
choice of Q(p). The finite difference form of (3.19) also introduces higher 
order effects equivalent to those in (3.2) and we can make a detailed 
comparison. If we let 


v,(t)=5,(t), 5,1 (4) 5, (1) = A, (1), (3.20) 
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(3.19) is equivalent to the pair of equations 


v, (t+A)=G(h,), (3.21) 
dh, 
ἢ ξευ,. (1) -- υ,(1). (3.22) 


In this form we introduce continuous functions v(x, t) and μᾷ(χ,1) such that 


 v(5,,t) =v, (1), (3.23) 


h Sait Sp 
[ΞΡ = 4,00), (3.24) 


and obtain corresponding partial differential equations in the approxima- 
tions of small A and small h,. Equation 3.21 may be written 


o{s,(t+A),t+A}=G ΠΝ rht) 


and it may be approximated by 
v+ (0, + v0,)A= G(h) + LAG'(h)h, (3.25) 


where the functions are evaluated at χ ξξ 5, ([) and the errors are of order 
A’, h*. Equation 3.22 may be written 


d Spi TS, 
Sn tt) = v(s,_),¢)—v(s,,¢) 


and approximated by 


5,..-1 +5, 
h, + vh,, = ho, at x= ἥν πεν. (3.26) 
The error in (3.26) is third order in h [due to the centering of h at the 
midpoint (s,_,+5,)/2], so the equation is correct to both first and second 
orders. In terms of p=1/h, V(p)= G(h), (3.25)3.26) become 


1 V'(p) 


v+(v,+ vv, )A= V(p)+ 5 κ᾿ Pxs (3.27) 


p, + (pv), =0. (3.28) 
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To lowest order in A and h, we would have 
v=V(p), p+ (pv), =9, 


which is just the kinematic theory. The differencing has been arranged so 
that the next order corrections leave the conservation equation (3.28) 
unchanged. 

Equations 3.27 and 3.28 are identical with (3.2) and (3.3) if we take 


τε Δ, γε ταν). 


Since V—c=-—pV'(p), the stability criterion (3.7) may be written 
2p7|V'(p)|A<1, 


Or, equivalently, 
2G'(h)A<1. 


This is exactly the condition found in the car-following theories (Chandler, 
Herman, and Montroll, 1958; Kometani and Sasaki, 1958). Similarly the 
shock structures discussed earlier on the basis of (3.2) should be close to 
those discussed by Newell (1961) on the basis of (3.19). 

An effect that cannot be covered by the continuum theory is the 
actual collision of cars. In a queue described by (3.19), this occurs if 
S,—1—S, ever drops to the car length L. In the special case 


§,(t+A)=a{s,_)(t)—s,(4) —L}, 


which can be solved by Laplace transforms, it may be shown that the 
criterion for avoiding collision is 


n—-1 


phe: 
e 


this is slightly more stringent than the stability criterion 2a4<1 found 
above. The analysis would take us too far afield and the reader is referred 
to the discussion of local stability in the paper by Herman, Montroll, Potts, 
and Rothery (1959). 


3.2 Flood Waves 


For flood waves, the “density” in the sense of the general theory 
presented in Chapter 2 is the cross-sectional area of the riverbed, A(x, ¢), at 
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position x along the river at time ¢. If the volume flow across the section is 
q(x,t) per unit time, the conservation equation is 


d x 
 [- At det a(xnt) - 4αγη πο, 


or, in differentiated form, 


94. 99 

Ὅγ + ax =0. (3.29) 
Flow in a river is obviously so complicated that any flow model for the 
second relation between g and A must be extremely approximate, giving 
only qualitative effects and general order of magnitude results for propaga- 
tion speeds, wave profiles, and so on. However, observations during slow 
changes in the river level may be used also to establish the dependence of 
depth and the area A on the flow q. These provide empirical curves for the 


function 
q=Q(A,x) (3.30) 


in steady flows. This relation can be combined with (3.29) to give a first 
approximation for unsteady flows which vary slowly. Then A(x, ¢) satisfies 


94 9Q0A__ 90 
fo Ok oe Τ᾿ (3.31) 


We have again the theory discussed in Chapter 2 with the propagation 
velocity 


_ 92 _192 
CA on on (3.32) 


[The second form introduces the breadth b and depth h, and dA = bdh.| 
This is the Kleitz-Seddon formula for flood waves, apparently established 
first by Kleitz (1858, unpublished) and thoroughly discussed and used 
effectively by Seddon (1900). 

Empirical relations for (3.30) can be viewed against simple theoretical 
models. The relation is an expression of the balance between the frictional 
force of the river bed and the gravitational force. In theoretical models, the 
frictional force is usually assumed to be proportional to v”, where v is the 
average velocity 


4? 


and also proportional to the wetted perimeter P of the cross-section at 
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position x. This force is then expressed as PoC,Pv? per unit length of river, 
where py is the density of water and C; is a friction coefficient. The 
gravitational force is p)gA sina per unit length, where a is the angle of 
inclination of the surface of the river. Hence 


A gsina 
v= \/— : 
VP G (3.33) 


__, 4 [4 gsina 
QO=vA= PC 


The wetted perimeter P is a function of A, and C, may also be allowed to 
depend on A. For broad rivers P varies little with the depth of the river 
and may be taken to be constant. If C, and a@ are also taken to be 
constants, (3.33) gives the Chezy law 


ἢ ὰ 1372 
ὑσ.. OnAr, 


Then the propagation velocity 


More generally, P and C; are functions of A, and power law dependences 
for these give υα A”, Qa A'*" with other values for n. For example, a 
triangular cross-section gives P « A'/? and leads to n=4; Manning’s law 
Cx AW '/3 leads to n= 3. For all these power laws the propagation velocity 
15 


c=(l+n)v. 


As expected, flood waves move faster than the fluid but the propagation 
velocity may not be very much greater than the fluid velocity. 

Seddon turns the calculation around and uses his observations of the 
propagation velocity to deduce the effective shape of the bed, that is, the 
dependence of P on A. This is a valuable idea in all kinematic wave 
problems: use observations of the propagation velocity c to infer the q-p 
relation. 

If the dependence of Q on x is omitted, (3.31) reduces to 


A,+c(A)A, =0, 


and the general solution may be taken from Chapter 2 with shocks fitted in 
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as discontinuities satisfying 


4, 41 

U= ———__ 
A,—A, 

For the power laws suggested (and this is also borne out by observations), 

c(A) is an increasing function of A; hence waves due to an increase in 

height break forward, and shocks carry an increase in height, A, >A). 


H igher Order Effects. 


As in the other examples discussed, a more accurate treatment of the 
relation between g and A than that expressed in (3.30) involves higher 
derivatives. In unsteady flow the frictional and gravitational forces do not 
balance exactly and their difference is proportional to the acceleration of 
the fluid; the difference between the slope of the water surface and the 
slope of the bottom also makes a contribution. 

It will be valuable to express the equations in conservation form so 
that, when necessary, appropriate discontinuity conditions can also be 
deduced. For simplicity, we consider the case of a broad rectangular 
channel of constant inclination a and work with the depth ἢ and mean 
velocity v as basic variables in place of A and g. The conservation of fluid 
for unit breadth can then be written 


ὦ (δι xy 
< [ hdx + [ho]. =0, (3.34) 


and we need to add a more detailed formulation of the conservation of 
momentum. The appropriate equation in hydraulic theory is 


d x1 27%1 1 2 ᾿ 
77 [ hv dx + | hv +| 5 8} cosa 


x 
X2 


=f sehen f "'Cwrde. (3.35) 
x2 


x2 


Apart from the common factors p, (the constant density of water) and the 
breadth δ᾽ which have been cancelled through, the five terms in this 
equation are, respectively, (1) the rate of increase of momentum in the 
section x,<x <x,, (2) the net transport of momentum across x, and x,, (3) 
the net total pressure force acting across x, and x,, (4) the component of 
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the gravitational force down the incline, and (5) the frictional effects of the 
bottom. The pressure term requires some comment perhaps. In hydraulic 
theory the dependence of the velocity on the coordinate y normal to the 
bed is averaged out to v(x,/) and the fluid acceleration in the y direction is 
neglected. The latter assumption means that the pressure satisfies a 
hydrostatic law 

op 

ὃν = — Po 8 COS a. 
Hence 


P-—Po=(h—-y)pog cosa 


and the total contribution of the perturbed pressure integrated over a cross 
section of the river is 


‘ l 
bf (»-- ρο) ὦ Ξ 5 h*pogb cosa; 


this is the origin of the third term in (3.35). 

Equations 3.34 and 3.35 are the two conservation equations for h and 
Ὁ. If h and v are assumed to be continuously differentiable, we may take 
the limit x, — x,->0 to obtain partial differential equations for h and v. It 
will be a minor saving in writing to introduce g’=gcosa and the slope 
S=tana. The equations for h and wv are then 


h, + (hv), =0, 


(3.36) 


(hv) ,+ (to? + 76} = 2hS -- Cyv’. 
We may also use the first equation to simplify the second and take the 
equivalent pair 


h, + οὗ, + hv, =0, 
(3.37) 


a2 
v, +00, + 2h, =g2'S— OF 
The kinematic wave approximation to (3.37) neglects the left hand 
side of the second equation and takes 


1/2 


"5 
h, + (hv), =0, o-(&] pi/2. (3.38) 
Cy 


Sec 3.2 FLOOD WAVES 85 


In this kinematic theory, discontinuous shocks must satisfy the shock 
condition 


 Ὀχῆχ-- vyhy 
U= ie ἢ (3.39) 


Stability; Roll Waves. 


We now consider the consequences of the additional terms in (3.37). 

For simplicity, S and C; are assumed to be constant. As in the traffic flow 

problem, we look first at the linearized form of the equations for small 
perturbations about a constant state v=», h=ho, where 

09 


If we substitute 
vD=Uytw, h=h)+n 
and neglect all but the first powers of w and ἡ, we have 


ἡ, Lon, + Now, = 9, 


wb owt eng te’S( 2 a 7 )=0. | 


We may then eliminate w and write the single equation for ἡ in the form 
0 0 0 0 23'S (. ὃ θὴ, 
[ ὅς ὃ Ἱ στο δ }n+ τα [ 2 + com jn =0, (3.41) 
where 


v 
Cy Ξυρε Vae%hy , C_=t)— Va'hy , Co= = (3.42) 


The equation is now the same as in the earlier discussion of (3.4) and (3.9), 
with appropriate changes in the expressions for c,, c_, and cy. Accord- 
ingly, the stability condition is 


Cree 67, (3.43) 
and this also ensures that the lower order approximation 


θη θη 
ae + “Onyx = 


> 0 (3.44) 
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Fig. 3.7. Roll waves. 


does not violate the characteristic condition. Equation 3.44 is the linearized 
version of (3.38), of course. 
The stability conditions may also be written, using (3.42), as 


0) <2V g'hy , 
or, again, from (3.40) as 
S<4C,. 


For rivers, v, is usually much less than V g’hg , but spillways from dams 
and other man-made conduits easily exceed the critical values. The result- 
ing flow is not necessarily completely chaotic and without structure. In 
favorable circumstances, it takes the form of “roll waves,” as shown in Fig. 
3.7, with a periodic structure of discontinuous bores separated by smooth 
profiles. Early observational data and photographs of the phenomenon 
were obtained by Cornish in 1905 and are beautifully described in his — 
classic book (Cornish, 1934), which summarizes his observations of waves 
in sand and water. The most specific data refer to a stone conduit in the 
Alps (the Griinnbach, Merligen) with a slope of 1 in 14. On an occasion 
when the mean depth was approximately 3 in., the mean flow velocity was 
estimated as 10 ft/sec and the whole roll wave pattern moved downstream 
at an average speed of 13.5 ft/sec. For these figures the Froude number 


0>/ Va'ho is 5.6, considerably in excess of the critical value of 2. These 
values would give S/C,= 12.5 and lead to C;=0.006. 

Jeffreys (1925) proposed the instability argument and noted that for 
smooth cement channels (for which he performed experiments) the friction 
coefficient is C,~ 0.0025; this value of C, agrees with current values. For 
the latter value, uniform flows should become unstable when the slope S 
exceeds 1 in 100. Jeffreys found his own experiments on the production of 
roll waves inconclusive, but he felt that long channels with slopes con- 
siderably in excess of 1 in 100 were needed. Much later Dressler (1949) 
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took up the subject and showed how to construct nonlinear solutions of 
(3.36), with appropriate jump conditions, to describe the roll wave pattern. 
The details will be indicated after the question of steady profile waves for 
the stable case has been considered. Ὁ 


Monoclinal Flood Wave. 


The structure of the shocks arising in the kinematic theory (3.38 and 
3.39) is particularly important in the flood wave problem, since in reality 
the shock thickness is of the order of 50 miles! It is obtained as usual by 
searching for steady profile solutions in a more detailed description which, 
in this case, is provided by (3.37). We look for solutions with 


h=h(Xx), v=v(X), X=x— Ut. 


The equations may be written 


av adh vp 
-- --Ο--.ς. .. Pe aa / — — 4 
(υ-- ΟἹ) ry +8 ay 2S OF (3.45) 
h(U—v)=B, (3.46) 


where the continuity equation has been integrated to (3.46) and B is the 
constant of integration. The uniform states (h,,v,) at X =0o satisfy 


ee eee τὴ 
5 _ ie Sho” 


h,\(U—0,)=h,(U— 0) =B. 


If we express all flow quantities in terms of h, and h,, we have 


S 7 S , 
v= c. h, οἷ τε CP hp, (3.47) 
ς 1/2 "ἢ 
Ό,) Ὁ᾽ g 14 
B= hice a 2. 3.48 
i) a | a | ni? + 8}. wee 


U 


1/2 


The last of these is exactly the shock condition governing discontinuities in 
the kinematic theory (3.39). This is the usual pattern and we expect the 
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solutions of (3.45) and (3.46) to provide the structure of these kinematic 
shocks. 
When v is eliminated from (3.45) and (3.46), the equation for h(X) 
takes the form 
dh (B- Uh) C,-g'h?S 0 
dX Se gt hB- Bz (3.5 ) 
Since the numerator must vanish for h=h, and h=h,, these two values 
must be roots of the cubic. Then the third root is 


Seek 
δ᾽ g’hyh, 
Ayhy 


— Ξε τα eG : 
ΠΣ 


Since H<h,,h,, and the solution has ἢ between h, and h,, this third root 
h= H is never a value taken in the solution considered. 
Equation 3.50 may now be written 


dh _ (h,—h)(h—h,)(h—- ΗῚ 
x ae (3.51) 


and the behavior of the solution depends critically on the sign of the 
denominator h?— B?/g’ and its possible change of sign in the profile. 
From (3.46), 


g/h?— B2=9/h3—(U-v) hr? 
=h{ g’h—(U-v)’}; 


hence the sign is positive or negative depending on whether 


U Svot+ Vath. 


[From (3.48), B >0; therefore from (3.46), U>v and U is always greater 
than o— Vg’h |] 


When h,-—>h,, we see from (3.49) that 
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h=ho 


h=hy 


ia ae ne ΠΝ 


Fig. 3.8. Structure of the monoclinal flood wave. 


In the stable case, $v,<v,+ Vg’h, ; thus for weak waves we start the 
integral curve of (3.51) from h=h,, X = οο, with the denominator in (3.51) 
negative. Accordingly, hy >0, h increases, g’h?— B* remains positive and 
we have a smooth profile as shown in Fig. 3.8. This is the so-called 
monoclinal flood wave. The fact that h,>h, is required for this profile 
agrees with the tendency of breaking of the kinematic waves, since this is a 
problem with c’(h)>0. A smooth profile of this type will continue to hold 
for the range of shock velocities 


30, 


: <U<v,+Ve'h, . (3.52) 


From (3.47) and (3.49) it is easily shown that this is the range 


1/2 
,,\'/2 1+ {14+4(5/G)'"} 
] « —* < 1/2 

hy 2(57 0) 

But (3.52) is the more significant form, in view of the physical interpreta- 
tion of the velocities. The shock moves faster than the lower order waves 


but slower than the higher order waves in the flow ahead. 
When 


0,+ V gh, <U<v,+ gh, : 


the denominator in (3.51) changes sign in the profile and the integral curve 


Fig. 3.9. Structure of the monoclinal flood wave with an inner discontinuity. 
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turns back on itself as in Fig. 3.9. A single-valued profile is recovered by 
fitting in a discontinuity as indicated. In contrast with the case of traffic 
flow, basic conservation equations in integrated form, (3.34) and (3.35), are 
known, and these apply whether the solution has discontinuities or not. If 
the same procedure developed in Section 2.3 is used on these,* the 
appropriate jump conditions at a discontinuity located at x =s(f) are 


—§{h]+[hv]=0, (3.53) 


το [he] +| ho?+ Σ κι τὸ (3.54) 


It should be especially noted that the right hand side of (3.35) makes no 
contribution in the final limit x,—>x,. These discontinuity conditions go 
along with the equations (3.36), just as (3.39) goes along with (3.38). One 
must be careful to pair correctly the equations and shock conditions at 
each level of description. In a change of the level of description, both the 
equations and shock conditions change in number. The discontinuities 
described by (3.53) and (3.54) are in reality the turbulent bores familiar in 
water wave theory as “hydraulic jumps” or breakers on a beach. 

In the present context, the proposal is to fit a discontinuity satisfying 
(3.53) and (3.54) into the steady profile solution of (3.36); hence it will also 
have the velocity U. In view of (3.46) any discontinuity between branches 
of the profile [including the lines h=h, and h=h, as possible solutions of 
(3.51)] will automatically have h(U—v) continuous to satisfy (3.53). The 
second requirement (3.54) determines where it should be placed. The 
condition (3.54) requires 


μυ(υ-- U)+ Sah 
to be continuous. From (3.46), this can be modified into 


BoA. τὸ 
n +8" 

Should be continuous. If the discontinuity is chosen to take the profile 
from h=h, to a point h=h* on the upper branch as shown in Fig. 3.9, the 


*Here to complete the physical problem we are requiring results from the later mathematical 
development in Chapters 5 and 10, but it seems better to include them here, with a minimum 
of explanation, rather than delaying completion of the solution. 
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requirement is 


B* 1, +2_ Β΄ lye 
je τὰν 
that is, 1/2 
h* {1+8B7/g’h}} —] 
ry 


This can be expressed in terms of h, and A, using (3.48). It can be verified 
that it meets the requirements (1) that g’h**— B?>0 so that h=h* is on 
the upper branch, and (2) that A* « ἢ; provided S<4C,. 


The overall conclusion, then, is that the original discontinuity of the 
kinematic theory (3.38 and 3.39) is resolved into a smooth profile when 
viewed from the more detailed description (3.36), provided (3.52) is satis- 
fied. For stronger shocks that violate (3.52), some discontinuity remains 
and corresponds to a shock discontinuity in the solution of (3.36). Further 
interpretations of the significance of (3.52) will be given (see Chapter 10) 
after the theory of characteristics and shocks for higher order systems has 
been developed in detail. 

The roll wave patterns referred to earlier are obtained by piecing 
together smooth sections satisfying (3.50) with discontinuous bores satisfy- 
ing (3.53) and (3.54). It may be shown that 9’h*?— ΒΖ must change sign in 
the profile but the smooth parts are kept monotonic by demanding that the 
numerator of (3.50) also vanish at the critical depth. This requirement 
relates the two parameters B and U; one or the other may be kept as a 
basic parameter in the family of solutions and is determined by the total 
volume flow. For further details, reference should be made to Dressler’s 
(1949) paper. 


3.3 Glaciers 


Nye (1960, 1963) has pointed out that these ideas on flood waves 
apply equally to the study of waves on glaciers and has developed the 
particular aspects that are most important there. He refers to Finsterwalder 
(1907) for the first studies of wave motion on glaciers and to independent 
formulations by Weertman (1958). 

In view of the difficulties of collecting data on the flow curves for 
glaciers, due to both the inaccessibility and the slowness of the flow, more 
reliance is placed on semitheoretical derivations. These consider in more 
detail the shearing motion in two dimensional steady flow down a constant 
Slope. Let u(y) be the velocity of the layer at a distance y from the ground 
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and let τί») be the shearing stress. For ice it seems to be appropriate to 
take the stress-strain relation as 


du 

<~=7", 3.55 
ae (3.55) 
where n=3 or 4. (Newtonian viscosity would be the case n= 1.) In 
addition, ice slips in its bed according to the approximate law 


νμ(0) =7™(0), (3.56) 


where m~}(n+ 1)~2. On the layer between y and y + ὃν, the difference in 
shearing stress must balance the gravitational force. If α is the angle of the 
slope and p is the density of ice, we have 


67 = —pdygsina. 
That is, 


a 
dy 


Since 7 vanishes at the surface y =h, the solution for τ is 


= —pgsina. (3.57) 


T= (h-y)pgsina. 
Then, integrating (3.55) with boundary condition (3.56), we have 


(pgsina) h” .1 (ρᾳ 5ἰπα})}" 


Ρ μ ntl {A"*!—(h—-y)"™}. (3.58) 


u(y)= 


The flow per unit breadth is 
h 
F(h)y=] u 
Or(h)= "ὦ 


(pgsina) h™*! " (pg sina) h”*? 


p n+2 (2?) 


For order of magnitude purposes, one may take 
O*(h) xh", 
with N roughly in the range 3 to 5. The propagation speed is 


_ 4Q* _ 
ς ah No 


where Ὁ is the average velocity @*/h. Thus the waves move about three to 
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five times faster than the average flow velocity. Typical velocities are of the 
order of 10 to 100 meters per year. 

Various problems can be solved using the results and ideas of Chapter 
2. An interesting question considered by Nye is the effect of periodic 
accumulation and evaporation of the ice; depending on the period, this 
may refer either to seasonal or climatic changes. To do this a prescribed 
source term f(x,f) is added to the continuity equation; that is, one takes 


h,+qt=f(x,t),  q*=Q*(h,x). (3.60) 


The consequences are determined from integration of the characteristic 
equations 


T= S(x,1)— OF(A,x), 


dx τ. 
i ΟΥ̓ (hx). 


The main result is that parts of the glacier may be very sensitive, and 
relatively rapid local changes can be triggered by the source term. 


3.4 Chemical Exchange Processes; Chromatography; Sedimentation 
in Rivers 


The formulation of equations for exchange processes between a solid 
bed and a fluid flowing through it was given in Section 2.2. The exchange 
may involve particles or ions of some substance, or it may be heat 
exchange between the solid bed and the fluid. Another instance is sediment 
transport in rivers (see Kynch, 1952). 

The equations coupling the density p, in the fluid and the density ρ, 
on the solid are 


0 
2 (p,+p,) + = (Vp,) =0, (3.61) 


0p, 
5; = (A — Bs) Op — kop, By). (3.62) 


For relatively slow changes in the densities and relatively high reaction 
rates k,, k,, the second equation is taken in the approximate quasi- 
equilibrium form in which the dp,/9dt is neglected and 


k,Ap, 


——__—_—_—_—_———., 3.63 
kB + (k,—ky) oy oo) 


Pp; = R(p,) = 
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When this relation is substituted into (3.61), we have 


Os V 9ρ, 
δὶ «1+R'(p,) ὃ OS) 
Thus the density changes propagate downstream at the rate 
= V | 
and 
k,k,AB 
R'(p,) = ———_—__, . (3.66) 
{ k,B+(k,— ky) p,} 
If the densities concerned are small this is approximately 
k,B 


The propagation speed depends on the reaction rates involved, being 
slower for substances with larger attraction toward the solid. If a mixture 
of substances is present in the fluid at the entrance of the column and the 
components have different reactions rates, they will travel down the 
column at different speeds. In this way the column can be used to separate 
the mixture into bands of the individual components. If they are also 
colored, a spectrum is formed. This is the basic process of chroma- 
tography. The nonlinear effects produce heavier concentrations at the 
beginning or end of a band depending on the sign of c’(p,). Of course, the 
nonlinear equations for a single component apply only after the separation 
has taken place. 

The shock structure and other aspects can be studied from the full 
equations 3.61 and 3.62. It is remarkable in this case that the full equations 
can be transformed (exactly) into a linear equation. This was shown by 
Thomas (1944). First, a moving coordinate system 


x 
Ce Om a 


γ᾽ V 


is introduced; the equations then take the form 


dp, 9 
τὰ ε ᾿ τ 
(3.68) 
Op, 


δ: Mp Bp, — YPsPy- 


Sec 3.4 CHEMICAL EXCHANGE PROCESSESS 95 


The first equation is solved identically by 


Pp=Y,, ρ,Ξ- Wy, (3.69) 
and the second equation provides an equation for Ψ: 
Vo, tap, + By, + γψοψ, =0. (3.70) 


If we now make the nonlinear transformation 


γψ = logx, (3.71) 
we deduce 


Xor + aX, + BX, =9; (3.72) 


the nonlinear transformation eliminates the nonlinear term. In terms of the 
original variables the transformation is 


1X KEV 
=-—, »p=--i—, 3.73 
OF χ " YX {5.0} 
ΧΩ Ὲ ἤχου (a+ B)x,+ BVx, ΞΟ. (3.74) 


The linear equation can be solved in general by transform methods so 
that in this case the solutions of the approximate equation (3.64), including 
shocks when necessary, may be compared in detail with the exact solution. 
This has been investigated extensively by Goldstein (1953) and Goldstein 
and Murray (1959). The exact solution endorses the views and methods for 
including discontinuous shocks in solutions of (3.64). The details are not 
given here since Burgers’ equation is simpler to deal with and provides the 
same endorsement. Some of the relevant analysis will appear in Chapter 10 
in a different connection. 

For exchange processes, a=k,A and B=k,B are both positive. For 
these signs, the uniform state is always stable. This may be checked by 
considering perturbations in (3.74). We note that the lower order waves 
travel at a speed 


BV 


Ci > 7 AS 
9. at+B 


while the wave speeds given by the higher order terms are c_ =O and 
c,=V. Thus for a>0, 8 >0, the stability criterion c_ <cCy<c, 18 satis- 
fied. 


CHAPTER 4 


Burgers’ Equation 


The simplest equation combining both nonlinear propagation effects 
and diffusive effects is Burgers’ equation 


C, + CC, = VC,,. (4.1) 
In (2.28) we saw that this is an exact equation for waves described by 


p,+4,=0, gq=Q(p)—>»p,, (4.2) 


in the case that Q(p) is a quadratic function of p. In general, if the two 
effects are important in a problem, there is usually some way of extracting 
(4.1) either as a precise approximation or as a useful basis for rough 
estimates. 

For a general Q(p) in (4.2), for example, the equation may be written 


c,+ cc, = ve,,.— ve" (p)p2, (4.3) 


where c(p)= Q’(p) as usual. The ratio of vc”(p)p2 to vc,, is of the order of 
the amplitude of the disturbance, and we therefore expect that (4.1) is a 
good approximation for small amplitude. We are then assuming that 
omission of this particular small amplitude term does not produce ac- 
cumulating errors (as t—>00, say) which eventually lead to nonuniform 
validity. We know, in contrast, that to linearize the left hand side by 
C, + ¢oc,, where cy is some constant unperturbed value, would be disastrous 
in this respect. But as a check, we may verify that in the shock structure 
solution (see Section 2.4), where the diffusion terms are greatest, the term 
ve"(p)p2 remains of smaller order than νος in the strength of the shock. 
This kind of argument can be made the basis of formal perturbation 
expansions in terms of appropriate precisely defined small parameters. On 
the other hand, the fact that the terms retained in (4.1) represent identifi- 
able and important phenomena, whereas the term vc”(p)o2 appears more as 
a mathematical nuisance, leads one to suggest (4.1) as a useful overall 
description even beyond the range of strict validity. 
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In a similar fashion, Burgers’ equation is relevant in higher order 
systems such as (3.2)-(3.3), when nonlinear propagation is combined with 
diffusion. Of course it is limited to the stable range and to parts of the 
solution where the lower order waves are dominant. The appropriate form 
is easily recognized and again can usually be substantiated by more formal 
procedures. In the case of (3.2)+(3.3), we know from (3.6) that the effective 
diffusivity is v»*=»—(v 9—c,)*r and we would use (4.1) with this value. 
Indeed, (3.6) is the fully linearized Burgers’ equation for this system. 

Our general purpose now is to show that the exact solution of Burgers’ 
equation endorses the ideas regarding shocks that were developed in 
Chapter 2. That is, we want to confirm that as »y—0 (in appropriate 
dimensionless form) the solutions of (4.1) reduce to solutions of 


c, + cc, =0, (4.4) 


with discontinuous shocks which satisfy 
U=+ (e+e), COU > ey (4.5) 


and the shocks are located at the positions determined in Section 2.8. 


4.1 The Cole-Hopf Transformation 


Independently, Cole (1951) and Hopf (1950) noted the remarkable 
result that (4.1) may be reduced to the linear heat equation by the 
nonlinear transformation 


Px 
c= --2ν---. 4.6 
(4.6) 


This is similar to Thomas’ earlier transformation of the exchange equations 
described in Section 3.4. It is again convenient to do the transformation in 
two steps. First introduce 


c=y,, 
so that (4.1) may be integrated to 


Ι 
Y, + a= VW xx: 


Then introduce 


= —2vlogp 
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to obtain 
P=? Pex: (4.7) 


The nonlinear transformation just eliminates the nonlinear term. The 
general solution of the heat equation (4.7) is well known and can be 
handled by a variety of methods. 

The basic problem considered in Chapter 2 is the initial value prob- 
lem: 


c= F(x) at ¢=0. 
This transforms through (4.6) into the initial value problem 


p= 0(x)=exp | ~ ὁ. [°F(n) dn], t=0, (4.8) 


for the heat equation. The solution for ¢ 1s 


Ι ᾿ (x-n)° 
Tig [ een | - Sa (4.9) 


Therefore, from (4.6), the solution for c is 


oe xX-1N Gp 
'd 
; cr i 


c(x,t)= ao τπξοος --- (4.10) 
[ ε΄ “νη 
where 
η (x—n)° 
G(nsxt)= { F(') dn’ + ------. (4.11) 


4.2 Behavior as v-—>0 


The behavior of the exact solution (4.10) is now considered as v->0 
while x,¢ and F(x) are held fixed. [Strictly speaking this means we consider 
a family of solutions with »y=e, and take the limit as «0, holding 
Vo, x,t, F(x) fixed.] As v0, the dominant contributions to the integrals in 
(4.10) come from the neighborhood of the stationary points of G. A 
stationary point is where 


0G x—7 
πάσα στ" ἔππαας, --- ΞΞ ᾿ 4.12 
ἡ F(a) 0 (4.12) 


ΐ 
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Let ἡ = &(x,t) be such a point; that is, ξίχ,1) is defined as a solution of 


sa ξ) 


Ἐ(ξ)-:Ξ =0. (4.13) 


The contribution from the neighborhood of a stationary point, n=, in an 
integral 


οο 
[π()ε σα, 
— 0 


FA as ~~ GE) f2P: 
View oo" 


this is the standard formula of the method of steepest descents. 
Suppose first that there is only one stationary point ξίχ, ἢ) which 
satisfies (4.13). Then 


iS 


and in (4.10) we have 


ew =k (4.16) 


where &(x,t) is defined by (4.13). This asymptotic solution may be rewrit- 
ten 


c= F(£) 
x=E+ F(E)t. 


_ It is exactly the solution of (4.4) which was discussed in (2.5) and (2.6); the 
stationary point ξ(χ, ἢ) becomes the characteristic variable. 

However, we saw that in some cases (4.17) gives a multivalued 
solution after a sufficient time, and discontinuities must be introduced. Yet 
the solution (4.10) for Burgers’ equation is clearly single-valued and 
continuous for all t. The explanation is that when this stage is reached 
there are two stationary points that satisfy (4.13), and the foregoing 
analysis of the asymptotic behavior requires modification. If the two 


(4.17) 
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stationary points are denoted by &, and ἔξ, with €,>€,, there will be 
contributions as shown in (4.14) and (4.15) from both &, and &,. Therefore 
the dominant behavior will be included if we take 


x — ” _ v χ -- ξ) " - - ν 
——|G (é, )|7 1/2, ~ GEE, )/2v ει Pk IG (é,)| 172, — GCE )/2 


[σ΄ (€,)|7 1/29 — GE) /2” 4. 1G" (E,)|7 1/29 σῶν (4.18) 
When G(é,)#G(§,), the accentuation by the small denominator ν in the 
exponents makes one or the other of the terms overwhelmingly large as 
v0. If G(E,)< G(&,), we have 


if G(E\) > GE,), 


x—& 


In each case (4.17) applies with either €, or € for €. But the choice is now 
unambiguous. Both é, and &, are functions of (χ,ἢ); the criterion G(,) = 
G(§,) will determine the appropriate choice of €, or ἔξ; for given (χ, ἢ). The 
changeover from &, to & will occur at those (χ, ἢ) for which 


G(§,) = G(&). 
From (4.11), this is when 


(x— ( τεῦ" 


[τα ') dn! + ΠΣ F(n') a+". (4.9) 


Since €, and &, both satisfy (4.13), the condition may be written 


+ { F(,) + F(£)} (ξι -&) = [ “F(a (4.20) 


This is exactly the shock determination obtained in (2.45). The changeover 
in the choice of terms in (4.18) leads to the discontinuity in c(x,?t) in the 
limit yp>0. All the details of Section 2.8 can be confirmed similarly. We 
conclude that solutions of Burgers’ equation approach those described by 
(4.4) and (4.5) as v0. 

In reality ν is fixed, but it is relatively small and we expect that the 
limit solution for v—-0 will often be a good approximation. For this 
argument, since ν is a dimensional quantity, we have to introduce a 
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nondimensional measure of » by comparing it with some other quantity of 
the same dimension. This is not hard to do. In the single hump problem, 
for example, where F(x) is as shown in Fig. 2.9, we may introduce the 
parameter 


A= [ {F(x)~¢o} ae. (4.21) 


The dimensions of A and ν are both L?/T, so that 


A 
R= (4.22) 


is a dimensionless number, and by “yv small” we mean R>1. If the length 
of the hump is L, the number R measures the ratio of the nonlinear term 
(c -- ορ)ς, to the diffusion term vc,.., in those regions where the x scale for the 
derivatives is L. (Inside shocks, for example, the x scale is of smaller order.) 
It will be convenient to refer to R as the Reynolds number, following the 
practice in viscous flow. 

Even with the meaning of “small v” decided, there are distinctions 
between the limit solution ν-- Ὁ and the solution for fixed small ν. As we 
saw in (2.26), the shock thickness tends to infinity if the strength tends to 
zero. Therefore for fixed R, even if it is large, any solution that includes 
shock formation or a shock decaying as t—oo will not always be well 
approximated by the discontinuity theory in these regions. As regards a 
shock formation region, the precise details are not usually important; one 
just wants a good estimate of where it forms, without details of the profile, 
and this is provided by the discontinuity theory. The effects of diffusion on 
decaying shocks as f->o0o is of more interest. We will explore these 
questions through typical examples in the following sections. 


4.3 Shock Structure 


The shock structure for (4.1) satisfies 


— Ucy + σον = vcyy, X=x— Ut. 
Hence 
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and the equation may be written 


(c—c,)(¢,—¢) = --2νοχ. 
The solution is 


this agrees with (2.25), since c=2ap+ β for quadratic Q(p). Solving for c, 
we have 


Cy— ec Cc, +e 
c=¢c, + τ 5 "΄'΄' .νε- 5 (4.23) 


Cy—C 
1+exp τ '(χ- Ut) : 


One can study how an initial step diffuses into this steady profile by 
taking 


ὯΝ C1 x >0, 
C,>c, χέθ, 


in (4.10)-(4.11). The solution may be put in the form 


Cy—C Ο, τῷ 
Ξε οι Ὁ τ-- ]'[Πὶ'ο με 353 (4,24) 


ζυΞς ᾿ 
1+hexp a ~(x— Ut) : 


where 


eo 2 
[ως 8 
h= ΞΞ fan)? laid Ξ-- -- (4.25) 
J e~ ἐξ 
(x -- cpt) / Vani 


For fixed x/t in the- range c,<x/t<c,, h-1 as too, and the solution 
approaches (4.23). 
4.4 Single Hump 


A special solution with a single hump may be obtained by taking 
F(x) =c)t+ A8(x) (4.26) 


as the initial condition. The parameter A agrees with (4.21) and the 
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Reynolds number is R= A/2v. The constant cy may be omitted without 
loss of generality, since the substitution 


C=Cyt+é, χορ +X (4.27) 
in Burgers’ equation reduces it to 
C, + 66, = vez; (4.28) 


Thus omission of cy is equivalent to viewing the solution from a frame of 
reference moving with velocity cy. Accordingly we consider only 


F(x) = A8(x). (4.29) 


The lower limit in the integral in (4.11) is arbitrary since it cancels out 
in (4.10). Therefore we may choose it to be 0+ and include the 6 function 
for ἡ «0 but not for 7 >0. Then 


(x-n)° 


21. 
(χ -ῇ 
21 


n > 0, 
δε: 


A, ης-0. 


The integrals in the numerator of (4.10) may be evaluated and those in the 
denominator written in terms of the complementary error function. The 
result is 


οἈ--] ewe [aut 
(x1) = coer cts See R=. (4,30) 


Va +(e®-1 “Pay 
7 +(e 7 ee ¢ 


The similarity form of the solution, that is, 
V 
c= V ; ἡ 


could have been predicted by dimensional arguments. The only dimen- 
sional parameters in the problem, A and ν, both have dimensions L?/T; 
there is no separate length and time with which to scale x and ¢ separately. 

As R->0 we would expect the diffusion to dominate over the non- 


linearity. For R<1 the denominator in (4.30) is Vz + O(R), uniformly in 


=; 4) 
vit »} 
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x,t,v; hence c may be approximated by 


c(x,t) =) Ξ Re~*/4 
A 


=e 4" (4.31) 
VAnvt 
This is the source solution of the heat equation c,= ΡῈ...» 
is verified. 
To discuss the behavior for large R it is convenient to introduce the 


similarity variable z=x/V2At and to write (4.30) as 


c= 4 g(z,R), 


so the expectation 


We now discuss the behavior of g as Roo for different ranges of z. In all 
cases, e®— 1 may be approximated by e* and we may use 


Ι e RU 27) 
SS re (4.33) 
2VR Va +eR/ ed 
zVR 
If z «0, the integral tends to 
[Ue Pab=ve 


— ho 
therefore g->0 at least like 1/VR . If z >0, the integral becomes small and 
we use the asymptotic expansion 


2 


οο δ] 
— ¢? e€ 
es a&w~ as ἡ-- 00. 
[ 2η 


Therefore 


Ζ 


Po an pea oR z>0, R>o. (4.34) 
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If 0<z< 1, we have 
g~zZ, 0<z<l1, R->o, (4.35) 


whereas if z>1, g-0 as Roo. Thus g-0 except in 0<z< 1, and in that 
range g~z. In the original variables, the result reads 


i ind<x<V2At, 
C~ 
0 outside. 


This is the appropriate solution of (4.4) with a shock at x=V2Az. The 


Shock velocity is U= VA/2t , and c jumps from zero to V2A/t, so the 
shock condition (4.5) is satisfied. 

The same expression (with cy=0 to fit our assumption here) appeared 
in (2.52) for the ultimate behavior of the solution of (4.4) for a general 
single hump. That was asymptotic in a different sense; it was the behavior 
as too within the description provided by (4.4). For a ὃ function initial 
condition, it is valid immediately. 

The shock is located at z=1, and for large but finite R (4.34) shows a 
rapid transition from exponentially smalt values in z>1 to g~z inz<1. In 
the transition layer z~1, (4.34) may be approximated by 


] 
es 4.36 
᾿ Ι-νπᾷκ τ") ( ᾿ 


In the original variables this would give 


24 
ΐ 


CH 


(4.37) 


Iter | ην Ξ4 (x—-V2At )+ 405284 | 


It agrees with the shock protue (4.23), with c,—c,=V2A/t and the shock 
located at x= ν24: to first order. From (4.36), the transition layer is of 
thickness O(R ἢ) around z=1. 

There is another (weaker) transition layer at z=0 to smooth out the 
discontinuity in derivative between g~0 in z<0 to g~z in 0<z< 1. It is 
clear from (4.33) that this transition layer occurs for 


z=O(R7'/*), 
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and for these values (4.33) may be approximated by 


il 
jee |: re er, 
2VR fe Pa 
zVR 
In the original variables we have 
—x*/4vt 
cy \f/- = 
d [ eV a 
x νάνι 


(4.39) 


The form of the solution for large R is shown in Fig. 4.1, where 


g(z)=cVt/2A is plotted against z. As R->oo the shock layer becomes a 
discontinuity in c and the transition layer at x =0 becomes a discontinuity 
in c,. In the scaled variables g and z, the profile is independent of 1. 
Therefore if the value of R provided by the initial condition is large, the 
shock remains relatively thin and the discontinuity theory of (4.4) is a good 
approximation for all t. This is true even though the shock strength is 


proportional to V2A/t and tends to zero as [> 00. 


A significant point in this connection is that the area under the profile 


remains constant even with diffusion included, since 


4 [7 «ἀ.-|νο,- γος =(. 


Hence the “effective” Reynolds number defined as 
] to) 


a Ἐς 


(172 Ay”? ς 


<.—\_> «ἰοου-. 
R~V2 Ro! 


x(2 At) 72 


Fig. 4.1. Triangular wave solution of Burgers’ equation. 
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remains constant for all ¢. The next example will show that the more usual 
situation is for diffusion to take over ultimately in the final decay, and that 
the single hump is exceptional in this respect. 


4.5 N Wave 


The final examples we consider are more easily derived by choosing 
appropriate solutions for @ to satisfy the heat equation (4.7) and then 
substituting in (4.6) to obtain c. As a rough qualitative guide to the 
appropriate choice, the profile for c will be something like p,. Thus for the 
single hump we could have taken the solution of m corresponding to an 
initial step function. To obtain an N wave for c, we choose the source 
solution of the heat equation for q: 


φειτγ eT /4et. (4.40) 


From (4.6), the corresponding solution for c is 


2 t —x?/4yt 
παι ee ύμδ (4.41) 


Φ 4 1+ Va/t ex /4n 


Since φ has a ὃ function behavior as t—-0, this is a little hard to interpret as 
an initial value problem on c. However, for any t>0 it has the form shown 
in Fig. 4.2, with a positive and negative phase, and we may take the profile 
at any t=1/,>0 to be the initial profile. It should be typical of all N wave 
solutions. 


(t/2A)%2%¢ 


Rolog t/to 
: .. x(2Ar) 2 
Rolog t/to 


Fig. 4.2. N wave solution of Burgers’ equation. 
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The area under the positive phase of the profile is 


00 
0 


[cax= —2pr[logq] 
0 


= 2rlog(1 ἐγ. ) (4.42) 


The magnitude of the area of the negative phase is the same. Thus in 
marked contrast with the previous case, the area of the positive phase 
tends to zero as t—>oo. If the value of (4.42) at the initial time 10 is denoted 
by A, we may introduce a Reynolds number 


Rom τιον γι ) (4.43) 


But as time goes on the effective Reynolds number will be 


κῶς ὦ [ἐπι 1+ ἢ (4.44) 


and this tends to zero as too. If ΚΟ], we may expect the “inviscid 
theory” of (4.4)-(4.5) to be a good approximation for some time, but as 
too, R(t)—>0 and the diffusion term will eventually become dominant. 
This is different from the previous example in which the effective Reynolds 
number defined in the same way remains equal to the initial Reynolds 
number. We now verify the details. 

In terms of Ro and ty, g= to(e*°— 1)’; hence (4.41) may be written 


a ~] 
2 
x ἢ ex /4ut 
mle a : (4.45) 


For Ry>1 (corresponding to 6). it may be approximated by 


—1 
~S {14 ΕΔ ἀμ (4.46) 
ΐ \ lo 


for all x and ¢. Now for fixed ¢t and Ryo, 


| . ~V2At <x<V2Al, 
C~ 
0 |x| >V2At. 


? 
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This is exactly the inviscid solution. However, for any fixed a and ν we see 
directly from (4.41) [and it may be verified also from (4.46)] that 


c~ τ γ΄ aa ales as [> 00. (4.47) 


This is the dipole solution of the heat equation. The diffusion dominates 
the nonlinear term in the final decay. It should be remembered, though, 
that this final period of decay is for extremely large times; the inviscid 
theory is adequate for most of the interesting range. 


4.6 Periodic Wave 


A periodic solution may be obtained by taking for φ a distribution of 
heat sources spaced a distance A apart. Then 


1/2 > χ-- πλ) 
p= (4πνι) 2 > ee SF" |. (4.48) 
δ { (x—nh)/t} exp{ -- α -- πλ)᾽ ,άνε) 
ἐ- -νῖξ- Θ΄ ὁ (4.49) 
ὺ Σ expt - (α-- πὰ) /4vt } 


When λ΄ Δάν)» 1, the exponential with the minimum value of (x — nd) /4vt 
will dominate over all the others. Therefore the term with n=m will 
dominate for (m— 3)A<x<(m+}4)A, and (4.49) is approximately 


cw πα, (m—5)A<x<(m+5)a 
This is a sawtooth wave with a periodic set of shocks a distance A apart, 
and c jumps from —A/2t to A/2t at each shock. The result agrees with 
(2.56). 

To study the final decay, A*/4vt<1, we may use an alternative form 


of the solution. The expression (4.48) is periodic in x, and in the interval 
—A/2<x<X/2, 


g—8(x) as t0. 
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The initial condition can be expanded in a Fourier series as 


- ~ 2 πῆχ 
Φ(χ)Ξ x [142 cos — 
and the corresponding solution of the heat equation for ῳ is 
φΞ - WS exp] — 4nin’ vt |cos en (4.50) 
λ : r2 λ 


It may be verified directly that this is the Fourier series of (4.48). In this 
form 


When vt /A?>>1, the term with n=1 dominates the series and we have 


ow om oe ἘΞ Anvt εἶν 2πχ 
Ee. \? A 


(4.52) 


This is a solution of c,=vc,,, and again the diffusion dominates in the 
ultimate decay. 


4.7 Confluence of Shocks 


When a shock overtakes another shock, they merge into a single shock 
of increased strength as described for the inviscid solution (ν--90) on the F 
curve in Fig 2.16. It is possible to give a simple solution of Burgers’ 
equation that describes this process for arbitrary v. 

The solution for a single shock is given in (4.23) and the correspond- 
ing expression for φ may be written in the form 


Cx ct 
φερε»  f=exp|-5-+ 457-5). (4.53) 


In (4.23), the parameters b,, b, which locate the initial position of the 
shock are taken to be zero. The expressions ἢ,» f, are clearly solutions of 
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the heat equation (4.7). The expression for c is 


(4.54) 


For c,>c,, f,; dominates as x->+ 00 and we have c->c,; f, dominates as 
χ- στοῦ to give c->c,. The center of the shock is where f,=/,, that is, 
x=4(C,+¢,)0. 

Now since any f, is a solution of the heat equation, we may clearly 
add further terms in (4.53) and generate more general solutions of Burgers’ 


equation. Such solutions represent interacting shocks. We consider the case 


C3 — C2 
p=fithtfs, b, = b,=9, b= ay 
(4.55) 
Cite +C 
- Ohi Ὁ ohrt 635 2. 3.5. ΡΦΡΟΡΌ. 
fithtf, 


If ν is reasonably small, we can recognize shock transitions between the 
States Cc), C), c; by noting in which regions the corresponding f dominates. 
At t=0, ἢ dominates in 0<x, f, in -—1<x<0, f, in x < —1. Thus we have 
a shock transition from c, to c, centered at x =0, and one from c, to c, 
centered at x = —1. For t>0, the regions in which c™c,, c™c,, c™c, can 
be found in the same way and the result is shown in Fig 4.3. For early 
times the transition from c, to c, occurs where ἢ =f, on 


Cc, + 
ΕΞ or; (4.56) 


Fig. 4.3. Merging shocks. 
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the transition from c, to c, occurs for f,=/, on 


Car iC 
x= 5 Ξ) Ξε]; (4.57) 


Since 3(c,+c3)>4(c,+¢,), the second shock overtakes the first at the 
point (x*, ἢ) determined by (4.56) and (4.57). At this point 


fi=h=h- 


For t>¢*, there is no longer any region where f, dominates and the 
continuing solution describes a single shock transition between c, and c;, 
moving with velocity 4(c,+c3) on the path 


Ci +c, 
χ-- χἕξ 5 (1-- Ὁ) (4.58) 
determined by ἢ = fs. 


CHAPTER 5 


Hyperbolic Systems 


The next step in the development of the theory of hyperbolic waves is 
to see how the ideas and methods established so far can be extended and 
amplified for the study of higher order systems. Some preliminary remarks 
have been made in the discussion of the various modifying effects on a 
single basic wave motion, but the questions relating directly to the possi- 
bilities of a number of different wave modes in a system have been 
touched upon only in passing. We now enter into the general discussion of 
these questions. 

Many physical problems lead to the formulation of a quasi-linear 
system of first order equations; such equations are linear in the first 
derivatives of the dependent variables, but the coefficients may be func- 
tions of the dependent variables. When these equations describe wave 
motion, a good understanding of many of the issues can be developed 
from the study of plane waves. Accerdingly, we start with the case of two 
independent variables. The two variables are often the time and one space 
variable so we denote them by ¢ and x and use corresponding terminology, 
but the discussion applies to any two-variable system. If the dependent 
variables are u,(x,t), i=1,...,n, the general quasi-linear first order system 
is 


ou, ou, = 
vay + Dy + =9 P= 1,°°° 4A, (5.1) 


where the matrices A, a, and the vector b may be functions of u,,...,u,, as 
well as x and ¢. (Here and throughout summation over a repeated subscript 
is automatic unless otherwise indicated.) 

In this chapter, we establish the conditions for (5.1) to be hyperbolic 
and discuss some of the general consequences. Some brief comments on 
the situation for more space dimensions are made, but the case of more 
dimensions is developed primarily in the context of specific problems and 
considerable use is made of the fact that any small region of a two or three 
dimensional wave behaves Jocally as if it were plane. 
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5.1 Characteristics and Classification 


The key to the solution of a single first order equation as described in 
Chapter 2 was the use of the family of characteristic curves in the (x,?) 
plane; along each characteristic curve the partial differential equation 
could be reduced to an ordinary differential equation. In some cases the 
solution could then be found analytically. But at worst the partial differen- 
tial equation could be reduced to a set of ordinary differential equations 
for stepwise numerical integration. In either event, one could proceed to 
build up the solution by successive “local” considerations of small regions; 
the whole solution did not need to be calculated at once. This, of course, 
corresponds to the simple ideas of wave phenomena; in any small time 
increment the behavior at a point can be influenced only by points near 
enough for their waves to arrive in time. For the system (5.1), we ask 
whether such local calculations are possible. If they are, the system is 
hyperbolic and a suitable precise definition will be framed. 

In general, any one of the equations in (5.1) has different combina- 
tions of du,/dt and du,/dx for each τ. That is, it couples information 
about the rates of change of the different u;in different directions; one 
cannot deduce information about the increments of all the u, for a step in 
any single direction. But we are at liberty to manipulate the n equations in 
(5.1) to see whether this information can be obtained from some combina- 
tion of them. We therefore consider the linear combination 


du, du, 
{πὲ +a ay ae) +h 0, (5.2) 


where the vector I is a function of x,t,u, and investigate whether I can be 
Chosen so that (5.2) takes the form 


du, Ou; 
m( Bae +a52)+48,=0 (5.3) 


If this is possible, (5.3) provides a relation between the directional deriva- 
tives of all the u, in the single direction (a, 8). When this is the case, it will 
be valuable to introduce curves in the (x,t) plane defined by the vector 
field (α,). If x=X(n), t=T(n) is the parametric representation of a 
typical member of this family, the total derivative of u i; on the curve is 
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Without loss of generality, we may take 


a=X'(n), B=T'(n), 
and write (5.3) as 


du, 
age κω (5.4) 


The conditions for (5.2) to be in the form (5.4) are 
LAy= m; 1 lai, ms γι, Χ', 
and we may eliminate the γι, to give 


1,(A,X’—a,T’) =0. (5.5) 


These are n equations for the multipliers /, and the direction (X’, T’). Since 
they are homogeneous in the /,, a necessary and sufficient condition for a 
nontrivial solution is that the determinant 


|A,X’—a,T’|=0. (5.6) 


This is a condition on the direction of the curve. Such a curve is said to be 
a characteristic and the corresponding equation (5.4) is said to be in 
characteristic form. 

Each equation in characteristic form provides only one relation be- 
tween the n derivatives of u, along the corresponding characteristic curve. 
To proceed with a local construction of the solution in some small region, 
we shall see below that n independent equations in characteristic form are 
needed. This is the basis for the definition of a hyperbolic system. 

First, however, a relatively mild but important restriction on the 
systems to be included in the definition must be noted. The restriction 
concerns the coefficient matrices A and a. It is not hard to see that either 
one, or even both, may be singular in simple situations. If the determinant 
|A,|=0, then 7T’=0 is a solution of (5.6) and the x direction is 
characteristic; if |a,|=0, then X’=0 is a solution and the ¢ direction is 
characteristic. Clearly, it may be acceptable for the axes to be characteris- 
tics, and these possibilities should be included in the discussion. Yet in 
some cases where both matrices are singular, the systems are so degenerate 
that they must be excluded. The two situations can be distinguished by 
checking whether a rotation of the axes cures the difficulty or not. If the 
trouble is merely that the original axes coincide with characteristics, a 
rotation of the axes will lead to a new system with nonsingular matrices. A 
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rotation of the axes in (5.1) replaces the original matrices by a linear 
combination of them. Therefore the appropriate condition is that 


JAA, + .a;,| 0, (5.7) 


for some A, μ, not both zero, and we do not need to carry out the 
transformation explicitly. If (5.7) can never be satisfied, we have the 
degenerate case that must be excluded. In that case, all directions are 
formally characteristic and the discussion is spurious. On the basis of 
examples given in the next section, it appears that systems which are so 
degenerate are unnecessarily large and can be reduced to smaller systems 
with coefficients that satisfy (5.7). 
With this restriction the following definition is introduced. 


Definition. A system (5.1), satisfying (5.7), is hyperbolic if n linearly 
independent real vectors I, k=1,...,n, can be found such that 


IO) A, =a, 8 —() (58) 


for each k, and the corresponding directions {a,8) are real with 
ας. BY 40, 

It should be noted that the emphasis is on there being n independent 
vectors I“, and it is not important that the corresponding directions 
(a, 8B) be distinct. If the directions are distinct so that there are n 
different families of characteristics, the system is said to be totally hyper- 
bolic; but we shall have little use for this term. As we shall see below it is 
possible for (5.6) to have less than n different solutions and yet n inde- 
pendent vectors 1 can be found. 


Special Case A, =6,,. 


t 
In many problems, the system (5.1) appears in the special form 
du, 
a, +49, + Fi =9, (5.9) 


where the A matrix is the unit matrix. In other cases one could transform 
the system to this form by multiplying by A7~', after a change of 
coordinates if the original matrix A is singular. It is seldom worthwhile to 
do this reduction in detail, but we may, when convenient, refer to this form 
without loss of generality. It is clear from (5.6) that in this form 7’+0, so 
characteristic curves will never have the direction of the x axis. We can 
therefore parametrize a characteristic curve by ¢ itself and describe the 
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curve by x= X(f). The linear combination 


Ou; Ou; 
Lhd 


lop ag ὁ δὰ 


takes the characteristic form 


Leo) on “ae (5.10) 
provided that 


La, = lc. (5.11) 


In particular, the characteristic velocity c must satisfy 


la, — c8,|=0. (5.12) 


The possible roots c are the eigenvalues of the matrix and the vectors 1 are 
the left eigenvectors. 


_ Two results follow from standard theorems in linear algebra: 


The eigenvectors 1 corresponding to different eigenvalues c are 


linearly independent. Hence the system is hyperbolic if (5.12) has n different 
real roots c. 


If a, is a real symmetric matrix, then all the roots of (5.12) are real 
and n independent real eigenvectors can be found. Hence the system is 
hyperbolic if a is real and symmetric. 


5.2 Examples of Classification 


Before proceeding with the use of the equations in characteristic form 
and with the further properties of characteristics, a few examples will 
illustrate the ideas and show some of the peculiarities that may arise in the 
classification of systems. 


Example 1. First consider the wave equation 


This can be written as a system by introducing u,=v, u,=w and writing 
v, — w, =), 


w,— yo, =0. 
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The linear combination 

1, (v,-w,) +1,(, — yo,.) =0 
takes the characteristic form 


1, (0, + cv,) +1h,(w,+ cw,) =0, 
provided 


There are nontrivial solutions when c*= y. If y >0, we may take 
c=+Vy, L=-Vy, l,=1; 
c=—Vy, L=+vVy, l,=1. 
The two vectors I are linearly independent, hence the system is hyperbolic. 
If y «0 there are no real characteristic forms; in fact, the equation is the 


prototype of elliptic equations. 


Example 2. The heat equation 


u, — Uu,, =0 

is equivalent to the system 
u,— v, =0, 
u,—v=0. 


It is clear that the combination 

I (u,—v,)+h(u,— v0) =0 
can be in characteristic form only if /,=0. Thus the only solution is 
1=(0, 1) or a scalar multiple of this. Since there is only one vector I for the 


second order system, it is not hyperbolic. If we check the general formal- 
ism, (5.6) reduces this case to 


=0, that is, T’*=0. 


aot tes 0 


Thus the x axis is a double characteristic, but there is only one 
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characteristic form for it: 


u,—v=0. 


Example 3. The simplest second order hyperbolic equation is 


xt 


an equivalent system is 


In this case both matrices A and a are singular but (5.7) is satisfied and 
there is no trouble. Equation 5.6 1s 


xX’ 0 
0 ae 


=(), thatis, X’T’=0. 


Both the ¢ axis and x axis are characteristics, and the original equations are 
already in characteristic form. 


Example 4. Consider now 
U,,— Yu, tu=0. 


If we introduce u,=v, u,=w, as in Example 1, the extra undifferentiated 
term in u prevents the completely obvious elimination of u and might 
suggest keeping three equations. If we choose 


u,—v=0, 
u,—w=0, 


w,— yo, tu=0, 


as an equivalent system, we have the trouble that both matrices A and a 
are singular, and so are all linear combinations of them. Equation 5.6 15 


—T’ 0 0 
xX’ 0 0 [Ξ0 
0 yT' x’ 
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and 15 clearly satisfied for all values of (X’,7’). However, this system is 
excluded by (5.7). 

At least in the case y>0, we can implement the suggestion noted 
earlier that the system is probably too big and can be reduced. We can 
spot the reduction by writing the equation as 


0 0 0 0 
οι Vg) Cea ee Ne = - 
& γ τς {Ὁ Ὁ yy jer ω 
Then the introduction of 
p=u,+Vy u, 


leads to the second order system 


This has nonsingular coefficients. In fact it is already in characteristic 
form, and there are just two characteristics. 


Example 5. An alternative system that might be proposed for the 
equation 


amas) ta u=0Q 
iS 


u, —w=0, 


γνν,-- yo, tu=0. 


This differs from Example 4 in that the equation v,— w,=0, obtained by 
eliminating u, has been substituted for u, —v=0. Now A is the unit matrix 
and we should have no trouble. The condition (5.6) is found to be 


xX’ 0 0 
0 x’ τ΄ |=90, thatis, X’(X’?—yT”) ΞΟ. 
0 γ΄ xX’ 


Two of the roots X’= + Vy T’ are clearly the characteristics of the original 
equation, but why has an extra characteristic X’=0 arisen? The system is 
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not too large as a system, but it is no longer equivalent to the original 
equation. It is in fact equivalent to 


0 
5p (Mie Yexx + u) =0. 
The extra characteristic corresponds to the extra ¢ derivative. 


Example 6. The system 
u,+ C(u,v)u, =0, 


v,+ C(u,v) vo, = 4, 


is clearly an example with one characteristic on which dx /dt=C, but with 
two independent characteristic forms. Hence it is hyperbolic. 


Example 7. The system 
u,+ C(u)u, =0, 


v,+ C(u)v, + C’(u) vu, =0, 


occurs in dispersive waves. The only possible characteristic form is the first 
equation as it stands. Hence the system is not hyperbolic. Yet because of 
the exceptional case that the first equation can be solved independently of 
the second, we can integrate the first equation along the characteristics 
dx /dt=C. Then once μ is known in a whole region , u, can be calculated 
and the second equation can be integrated along the same characteristics 
to find v. For these purposes it is like a hyperbolic system with a double 
characteristic, yet formally it would be classified as parabolic. 

In Examples 2 to 7 the classification is not completely straightforward. 
We now add a few nonlinear examples where there is no problem in the 
classification but they are typical and rather well known. We list pertinent 
information with a minimum of explanation. 


Example 8: Gas Dynamics. In the compressible inviscid flow of gas 


with velocity u, pressure p, density p, and entropy S, the equations (see 
Chapter 6) are 


p, + up, + pu, =, 
l 
u, + uu, + παν 


S,+uS, =0, 
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where p= p(p,S). The characteristic equations are 


dp du ax _ 
Were ay =(0 n A 
2 56 on & wi 
dt at ὶ 


where αὖτπο (ὃρ ,8ρ)ς.- constant: FOr ἃ gas with constant specific heats 
p=k«p%e*/® and a?=yp/p. 


Example 9: River Waves and Shallow Water Theory. ‘The equations 
were given in (3.37); the characteristic forms are 


2 
© (vt2Vg'h )=8/S— Ο- on Says Veh. 


Example 9’. The reduced kinematic approximation (3.38) has a 
single characteristic form 


ὦ 1/2 
dh _ dx _ 3,8» 1/2 
at at 3( an 


Example 10: Magnetogasdynamics. For a conducting gas in a mag- 
netic field the equations (using standard notation) are sometimes taken to 


be 
p, + up, + pu, = 0, 
p(u, + uu) +p, =JB, 
y-1 P; P y—1p ρ, x δ᾽ 
Ι ᾿ 
€,E,+ —B,+j=0, 
μ 
where ἡ Ξσί( -- ΜΒ). The characteristic velocities are  (ερ μ) 142, uta,u. 


Example 10’. When the conductivity σ is very high the following 
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reduced set may be derived and is often used as an adequate approxima- 
tion: 


p, + up, + pu, =0, 


B,+ uB, + Bu, =0, 
p(u,+uu,) +p, + (BB, =0 


1 Y Pp Ε 
yay] (Pet wx) γ--] πον νὼ 


The characteristic velocities are now u+(a*+ B*/pp)'/2, u, u. 
Example 11: Nonlinear Effects for Electromagnetic Waves. Ina simple 


but probably unrealistic formulation of the effects in nonlinear optics, one 
may take 


dx 
---- = + 
dt ~ C(E) dt On dt Ὁ) 


where c(E)= { uD’ (E)}~'/?. Dispersive effects usually make the relation 
D= D(E) inadequate. 


Example 12: Nonlinear Elastic Waves ina Bar. The one dimensional 
equation for waves in a bar may be formulated in terms of the displace- 
ment &(x,7) of a section initially at position x and the stress o(x,1?), as 


Posi = Ox» 


where py is the initial density in the unstrained state. If the strain e=£&, and 
velocity u=€ are introduced, the equivalent pair 
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may be used. The linear theory takes oxe, but nonlinear effects may be 
included by taking o as a more general function o = o(e). The characteristic 
velocities are + {0’(€)/p9}!/*. For the appropriate choices of o(e), there are 
interesting effects in the wave propagation; in particular, it is a little 
surprising perhaps to find that shocks are produced in the unloading phase 
of a disturbance. An account is included in Courant and Friedrichs (1948, 
p. 235). 


5.3 Riemann Invariants 


Each equation in characteristic form introduces a particular linear 
combination of the derivatives. For simplicity we consider the reduced 
form (5.10), where the linear combination concerned is /,du,/ dt. In a linear 
problem, the vector I is independent of u so that a new variable r=lu, 
simplifies the form of the equation to 


dr τ 
ὩΣ + f(x,t,u) =(0. 


In nonlinear problems, however, | may depend on u and it is not always 
possible to achieve this form. It would be necessary to find A and r such 
that 


ἰ ἄμ; Ξε ὰ ἄγ, 
or, equivalently, | 
or 


(Here x and ¢ are held fixed; the differential dr refers only to changes in u.) 
This is a special case of Pfaff’s problem for the integrability of differential 
forms. For n=2 we may eliminate r and find an equation for A which 
clearly has a solution. For n >2, however, elimination of both A and r from 
(5.13) gives conditions on the /, which must be satisfied for this to be 
possible. 

For a hyperbolic system, the ” characteristic equations take a particu- 
larly simple form if it should turn out that a variable r, can be introduced 
corresponding to each differential form /,“du,. Then the functions r, can 
be used as new variables in place of the u, and the characteristic equations 
can be written as 


dr, dx 
Gi thestr)=0 on T= ¢,(x,1,4). (5.14) 
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This can always be done for linear problems, and in that case the f, are 
linear in r. For nonlinear problems it can be done when n=2, but it may 
not be possible for n>2. 

Such variables were introduced by Riemann in his work on plane 
waves in gas dynamics, a case with n=2. In that particular case (see 
Section 6.7), the f, are zero so that r, and r, are constant on their 
respective characteristics; the functions r, and r, are then called Riemann 
invariants. In general, we might call the r, Riemann variables. 


5.4 Stepwise Integration Using Characteristics 


Insight into the structure of solutions of hyperbolic equations, such as 
the correct number of boundary conditions and the domain of depen- 
dence, can be obtained by imagining a construction of the solution at 
successive small time increments. For simplicity, it will be assumed that the 
characteristic equations can be put in the form (5.14), but the qualitative 
features apply to the general case. 

Consider the mixed initial and boundary value problem in x >0, ¢>0, 
with data prescribed on x =0 and +=0. If we take any point P in the (x,/) 
plane, and if Q, is a neighboring point on the kth characteristic through P, 
(5.14) can be approximated to first order by 


τ. (Ρ)-- (ΘΟ. ) +h.(Q,){t(P) — t(Q,) } =9, (5.15) 
x(P)-x(Q,)=¢,(0,) {t(P)-—t(Q,)}, (5.16) 


with an obvious notation for the values of quantities at P and at Q,. If 
these relations are used for all n characteristics, and if the values at the Q, 
are all known, (5.15) gives n equations for the r, at P. If some of the c, are 
the same, some of the Q, will coincide, but this does not matter provided 
there are the full complement of n different equations (5.15). 

We now use this construction repeatedly, as shown in Fig. 5.1. The 
sketch is for three characteristics, with c, >c,>0>c 3. Take first the point 
P at the first time increment ¢(P)=At. The point is taken with sufficiently 
large x(P) for the characteristics PQ,,PQ,, PQ, to intersect the positive x 
axis as shown. In all cases 1(Q,)=0. If all the r, are known initially as 
functions of x for t=0, then each c, is known as a function of x; hence for 
a chosen x(P), (5.16) determines x(Q,) for each k. Then r,(Q,),f,(Q,) are 
calculated from the initial values of r,, and (5.15) determines r,(P). This 
can be repeated for various points P on the line ἐξξ Δί provided that they 
are to the right of the point W, defined by the characteristic segment OW 
through the origin with the fastest velocity c,. 
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0 q3 Py Q, Qe Q3 P32 Xx 


Fig. 5.1. Stepwise construction of solutions using characteristics. 


These calculations can be repeated at successive time steps. For 
example, at P’ the values of r, are determined in terms of the values 
Q;,Q23,Q3; which were determined at the previous step. Thus in principle 
the solution in the triangular region to the right of the characteristic OW 
can be determined from the given values of all the r, on t=0, x >0. It is 
also clear that the values at P’ will depend only on the data between P; 
and P; on the x axis, where P’P; and P’P, are the characteristics 
corresponding to the fastest and slowest speeds, respectively. The segment 
P,P3 is the domain of dependence of P’. The domain of dependence shows 
the wave character of the solution; signals propagate with speeds c,,c,,¢3 
and waves from the points between P; and P3; are the only ones that can 
reach P’ in time. 

For the full initial value problem, with r, given on t=0, —o<x< 0, 
it is clear that the solution can be constructed in ¢>0 and it is unique. 
Provided the solution is stable (a question that we do not go into here), this 
problem is well-posed. | 

But we return to the mixed problem with data given on t=0 only fo 
x>0O, and the remaining information made up by data on x=0, ¢>0. 
Consider a point p on t=At, but to the left of W, so that the two positive 
characteristics intersect the ¢ axis at 4,»4). The value r,(p) is still deter- 
mined from the data at qg, on the x axis. In fact, p can be taken on the 1 
axis and 7, is still determined from the data on the x axis. Thus 7, cannot 
be prescribed on x =0. To determine r,(p) and r,(p), the values of r,,72,73 
will be required at g, and q,. On x =0, r; is calculated not prescribed, but 
clearly r, and r, must be given. At later time steps this is repeated. For 
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example, at p’ the value of r, is determined from 42; r,,r, are determined 
from q',g5, and r,,r, must be given at these points. Thus the well-posed 
problem is 


Γ1» 7» 73 givenat f=0,x>0, 
Tits givenon x=0,1>0. 


The data on x=O only affect the solution to the left of the 
“wavefront” OWW’. Of course equivalent data may be posed and, in 
general,any three conditions on ¢=0 and any two conditions on x =0 will 
be correct. The only exception is that on x =0 the two conditions must not 
determine 73» since this is determined by the initial data. 

The results here can be generalized to n equations and other boundary 
values. From the stepwise construction it is clear that the number of 
boundary conditions should be equal to the number of characteristics pointing 
into the region. The direction along a characteristic must be defined in 
order to make “pointing in” unambiguous. When ¢ is time the direction 
usually is chosen as ¢ increasing; but ¢ increasing, or x increasing, or even 
some function of them increasing, will all lead to well-posed problems 
provided the direction once chosen is used consistently. 

Detailed proofs of existence and uniqueness may be based on itera- 
tions of the integral equations which express the value at a point as an 
integral over its characteristic triangle (e.g., P’ P,P; for P’ in Fig. 5.1). The 
whole procedure is similar to the Picard iteration for ordinary differential 
equations. Reference may be made to Courant and Hilbert (1962, p. 476). 

In nonlinear problems the characteristic speeds c, are functions of u; 
hence the number of data posed on any boundary can change with the 
data. 


5.5 Discontinuous Derivatives 


In the preceding construction of the solution it is clear that the 
characteristics carry information from the boundaries into the region 
concerned. Physically, the characteristics correspond to waves propagating 
with the velocities c,. In a general way, one expects from the construction 
that any abrupt change in the data on a boundary will produce corre- 
sponding abrupt changes propagating on the characteristics through those 
boundary points. If the abrupt change is taken to be a discontinuity in 
some of the derivatives of u, the vague idea becomes precise and the 
expectation is that discontinuities propagate along characteristics. The 
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appropriate results can be shown directly from the equations. The argu- 
ments will be given for simple jump discontinuities in the first derivatives 
of the u, Higher derivatives and other singularities can be handled simi- 
larly. 

Let &(x,t)=0 be a smooth curve separating two regions in each of 
which u is continuously differentiable. Suppose that the u,; are continuous 
as €>0=+ and that du; / dt, Ou, / 0x have finite limits as 0+. If &x,t) isa 
sufficiently smooth function, we can introduce a new local coordinate 
system (x,t), ηί(α, ἢ), and write (5.1) as 

ou, 


Ou, 
(Αγξ, + 58, ) τς + (Ayn + ayng) 3 + 6 =0. (5.17) 


These equations hold in each of the regions €>0 and €<9. By hypothesis, 


hence 
du.(O+,7 du,(O—,n 
CGO) 220s): (5.19) 
θη θη 


This means that the tangential derivatives are continuous across =0, and 
only the normal derivatives related to du,/d€ may jump. The limits of 
(5.17) are finite as €>0+, and the coefficients are all continuous. There- 
fore taking the difference of the limits on the two sides, we have 


el 
(ἀνθ τα) τὲ =0, (5.20) 


Ou; 7 Ou; du, 
τε τίσ) Ge), 


Accordingly, the jumps [du,/0£] are zero unless 


where 


|4,,€, + a,&,|=0 (5.21) 


on €=0. If the curve &x,t)=0 is described in the alternative form 
x= X(m), t= T(m), then 


(ξ,,ξ.) α {Χ΄ (), -- T’(n)}. 
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Hence (5.21) is the same as (5.6), and we have the result that discontinuities 
in the first derivatives of u can occur only on characteristics. 

According to this, propagating discontinuities are ruled out if the 
system has no characteristics, and in such a case any discontinuity in the 
boundary data will be immediately resolved in the solution. On the other 
hand, the existence of characteristics is not a sufficient guarantee that 
discontinuities can occur. The equations provide further restrictions on the 
[du,/0é] and if the system is not fully hyperbolic, these may be stringent 
enough to require [du,/d£]=0. If the system is hyperbolic, however, the 
additional relations do not exclude discontinuities; they provide instead 
equations that determine the variations of the magnitudes of the discon- 
tinuities as they propagate along the characteristics. 

If (5.21) is satisfied and a particular characteristic is chosen, the 
equations in (5.20) give a number of relations between the quantities 
[du,/0€] on that characteristic. The number will depend on the rank of the 
coefficient matrix in (5.20). The simplest case is when there are n—1 
relations, so that all the jumps [ du, /d£] are determined in terms of one of 
them, or, more symmetrically, 


of | ag 5.22 
δὲ σ 7» ( ᾿ ) 
where L is any nontrivial solution of 
(A,§ +4, 6, )L,=0, (5.23) 


and o is undetermined at this stage. If the rank of the matrix is r, there will 
be n—r independent solutions of (5.23) and a corresponding number of 
terms in (5.22) with n—r parameters o. 

Additional information may be obtained by taking the € derivative of 
(5.17) and considering the difference of the limits as 30+. The result 


takes the general form 
d | 9} | ou, 
Ἐ64Ὶ —|)— || --2[|]-Ἦ 5.24 


where E, is linear in the first argument and at most quadratic in the 
second. Although in the main, these equations give information on the 
jumps of the second derivatives [0*u,/d£7] across £=0, the fact that the 
matrix 


ὃ *u 


J 


d&? 


(4γξ, Ἔ ag, ) | 


Ag, + ay, 
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is singular means that certain linear combinations of the E, vanish. These 
provide further relations to be satisfied by the [du,/d€]. The number of 
them will be n—r (where r is again the rank of the matrix), and this is just 
the degree of arbitrariness remaining after solving (5.20). These relations, 
then, provide equations for the o’s introduced in (5.22). 

The details become fairly complicated so we pursue them further only 
for the case of discontinuities at a wavefront propagating into a region of 
constant uniform state. This includes all the important features and is, in 
any case, the main application of the discontinuity analysis. We shall also 
take the system to be in the reduced form (5.9). 


5.6 Expansion Near a Wavefront 


We consider the system (5.9) in the case that it admits constant 
solutions u,=u{. This requires b to be independent of x,t, and requires 
uw to satisfy 


b,(u) =0. (5.25) 


For the reduced form (5.9), the characteristics can never be in the direction 
of the x axis, so we may use ὦ itself as a parameter on the wavefront and 
write the equation of the wavefront in the form x= X(t). Instead of 
calculating the limits of derivatives from the equations, it is particularly 
convenient in the wavefront problem to use the equivalent procedure of 
expanding solutions in powers of 


€=x—X(t). 


If the first derivatives are discontinuous the appropriate form is 


u=u, €—>0, | (5.26) 
y= ui + ul? (1+ SUM ()ε.Ὲ...,. ECO. (527) 
Then 
du, Ou, 
Paorne: ae |-- 470. (5.28) 


and the higher derivatives are related to the other coefficients similarly. 
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The power series version is a convenient way of seeing the extension 
to include other singularities. If the mth derivatives are the first ones to be 
discontinuous, the power series beyond u{ start with the terms in ξ΄", and 
one could also include singularities which correspond to expansions in 
fractional powers of || or in log |§|. The questions of convergence are not 
really at issue here; we are using formal power series as a device to 
calculate derivatives which could also be obtained by taking the 
appropriate limits in the equations. 

The coefficients in (5.27) are obtained by substituting the series in 
(5.9) and equating the successive terms in powers of ξ to zero. If the a, are 
functions of x,t, u, they must be expanded in powers of € with coefficients 
depending on ¢. That is, 


dq”? (0) 

— ,(0) yl) _ 
Ω͂,.ΞΞ ay’? + ur? + +ee:, 5.29 
( Ou, K x ( ) 


where the superscript zero means that the arguments of the corresponding 
function are x= Χ(, ¢ and u=u., However, in writing the resulting 
equations for the u{”(t), the superscript zero will be omitted in the 
interests of clarity. From the substitution in (5.9), we have 


aut? — cut? =0, (5.30) 


dus 1) da; 


da, Ob, 
spite) che) ct aa CO) at a οὔ (1) — Δ] 
aus — cus +) a + au, uz +| ax x) fy 0, (5.31) 


and so on, where c denotes X. These correspond of course to (5.20) and 
(5.24). . 
From (5.30), we deduce first that the velocity X=c must satisfy 


la, — c6,| =0, (5.32) 


and the wavefront must be one of the characteristics. If we take the case 
that the rank of the matrix in (5.30) is n—1, we have 


us =oL,, (5.33) 


where L, is any nontrivial solution of 


(a,—8,)L,=0. (5.34) 


132 HYPERBOLIC SYSTEMS Chap. 5 


There is also a nontrivial eigenvector 1 which satisfies 
I(a,—cd,)=0. (5.35) 


[It is this left eigenvector that arises in the characteristic form (5.10).] If 
this is applied to (5.31), the terms in u are eliminated and we have 


du? da; da, db. 
pao ED pees (1) C1) _ t (1) = 
ἰ Ξ +, 5a, un? +l, > tT ae us’ Ξ 0. | (5.36) 


Finally, substituting the expressions for μι") from (5.33), we have an 
equation of the form | 


iL, + Qo? + Po=0, (5.37) 
where /.L,, Q, and P are all known functions of ¢. 
For hyperbolic systems it may be shown that /.L,0. In other cases, 
however, one might have /.L,=0, Q@=0, P#0, and have to conclude that 
o =(Q; that is, discontinuities are impossible. For example, the sytem 


u,—v=0, 
v, — u, =0, 


which is equivalent to the heat equation with z and x interchanged to fit 
the canonical form (5.9), has the curves x=constant as a double 
characteristic. But 1=(1,0), L=(0, 1), Ο ΞΞ0, P= — 1, so that discontinuities 
are impossible. 

For hyperbolic systems, (5.37) may be reduced to 

do 2 ᾿ | 

— + go°+po=0. (5.38) 
dt 
This is a Riccati equation which can be solved explicity to find the 
variations of o [and hence the ui] along the wavefront. 

If the original system is linear, the a, are independent of u and the 
quadratic term is absent. The solution is 


o(t)=a(0Je™, — p(t) = f p(e)ar’ (5.39) 
0 


where o((Q) is determined by the initial conditions. We observe, in particu- 
lar, that discontinuities can only appear in the solution as a consequence of 
corresponding discontinuities in the boundary or initial conditions. 


Sec 5.6 EXPANSION NEAR A WAVEFRONT 133 


Moreover, once introduced, they can not disappear in a finite time. 
For nonlinear systems with g#0 in (5.38), the equation may be 


rewritten as 
d{l p = 
dt ( σ ) σ "πὸ 


1 efi) 


σ᾽ (0) 


and the solution is 


t , 
+ ented f q(t'je~P dt’, (5.40) 
0 


Again, discontinuities once generated cannot disappear at a finite time; 
they may decay to zero strength as too. However, a new possibility in the 
nonlinear case is that o-oo for finite t. This occurrence will depend on the 
signs of p(¢),q(z), and on the magnitude of o(0). Suppose, for instance, that 
the equation reads 


— = po" — po, (5.41) 


where μι», are positive constants and 0(0)=0,>0. If o,<p/v, the right 
hand side of (5.41) is negative initially so that o starts to decrease. But then 
the right hand side remains negative and so o continues to decrease. 
Ultimately, o-—0 like e™“ as too. However, if o> /v, the reverse is 
true and o continually increases. Eventually, the term vo? dominates and 
leads to o-00 in finite time. The explicit solution is 


So 


δ nee 
o(t)= ν σρ- (σ»-- μ))»)6"΄ 


(5.42) 


If o9—p/v >0, 0-00 as 


do 


— pf 


This predicts the nonlinear breaking of the wavefront, and after this 
time a shock wave, with discontinuities in the functions u, themselves, must 
be introduced. Although this breaking discussion and a criterion such as 
(5.43) is limited to the special form of wave with a discontinuous deriva- 
tive, it is extremely valuable because it is always possible to carry out this 
calculation explicitly. The functions p(t), g(t) appearing in (5.38) depend 
only on the coefficients a,,b,. It is not necessary to find the solution in a 
whole region of the (x,t) plane before the information can be obtained. A 


(> koe (5.43) 
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continuous profile does not have to behave in precisely the same way, but 
one gets a rough estimate of the magnitude of the derivatives required to 
produce a breaking wave and an estimate of the time of breaking. In 
general, it may not be possible to find explicit solutions for continuous 
profiles to determine the criteria exactly for those cases. 


5.7 An Example from River Flow 


As an interesting application of the wavefront expansion, we consider 
the river flow equations discussed in Section 3.2. They are 


h, + vh,, + hv, =9, 


Ε. (5.44) 
v, t+ v0, + Ph, = 5΄ 5 -- OF 


The uniform flow has constant values h=ho, v=o, with ρ΄ δ᾽ = Cyv9"/ho, 


and the wavefront has constant speed in this case so we take £=x— ct. 
Behind the wavefront the flow variables are expanded as 


h=ho+ h\(t) + 5 §7h,(t) + nak 
v= ogo, (1) + 5 £709(t) τ τ, 


These are substituted in (5.44) and the terms in successive powers of é give 


(v9—c)A, +hov, =9, 


(5.45) 

gh, +(vo—c)v, =0; 
dv 20 h (550) 

/ 1 2 t 1 1 
=a 552... + i eee ee 6 
gh,+(vy—c)v,+ Fi + 0; rs τ τ 0; 
and so on. From the first two we have 

(c— 09) =8' Ig (5.47) 


so the propagation speed is 


C=Upt Va'hy 5 (5.48) 
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we also have 


2 (c—vp)h, 


i, (5.49) 


0 


The relation (5.47) allows us to eliminate h, and v, from (5.46). The 
resulting equation is 


ἘΠΕ MEP h, + ( ) ail 24 (2 0 
8΄. τ +2g'v C— 9) { = ἜΥ1Ὲ 5΄ 51] --- -- -- [)5Ξ0. 
dt 2} ὦ ges Pee ho 


Finally, eliminating v, by use of (5.49), we have 


dh, 3 


h? S 309 \ h, 


2 | ἢ 


The quantity h,(2) is the value of the derivative h, at the wavefront. 
We now consider various special cases. 


Shallow Water Waves. 


In the usual shallow water theory the slope and friction terms are 
absent from (5.44); (5.50) becomes 


dh, _ 3 h? 
dt No 


Downstream waves, with c=v)+ V gh, , break if h, <0; upstream waves, 
with c=v9— V ghy , break if h, >0. 


Flood Waves. 


For flood waves going downstream, c=v)+ V g’h, and (5.50) reads 
ὕρ 


Sas 11}... a |e 
at 2 ho Uo | 2. /2'Ng 


If vo/ Vg'ho >2, the linear term indicates an exponential increase for h, 
of either sign. This corresponds to the instability of the steady flow under 
these conditions and checks with the result deduced from (3.41). If 


(5.51) 
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Uo/ V g’hy <2, the linear term shows exponential decrease corresponding 
to stability. However, if h,(0)<0 and 


ge | 1 ὗῦρ 


h a εῦβορε = de 
μιι(0}}» 3 τ 


dh, /dt<0 and h,—>-— οὐ in a finite time. This corresponds to nonlinear 
breaking at the wavefront, and a bore will be formed at the head of the 
wave. This checks with the analysis in Section 3.2 where it was shown that 
a sufficiently strong flood wave would be headed by a bore. 


Tidal Bores. 


For a wave propagating upstream, c= tv )— V g’hy , and (5.50) reduces 
to 


dh, 34/2 . 8.5 1 % 
ds No ἐπὰν 1+ -- h,. 


A wave with positive h, will break if 


2 SV g'ho 1 ὔῦρ 
Sa τ; . (5.52) 
Up \ /9'No 


h,(0) > 


The existence of a minimum value of h,(0) which must be exceeded 
for bore formation is particularly interesting in view of the well-known 
observation that only relatively few rivers, with sufficiently high tidal 
variation at their mouths, develop tidal bores. The analysis here is limited 
to a wavefront, whereas the appropriate case for tidal bores would be an 
initially smooth sinusoidal variation. However, it has the usual virtues of 
analytic results; one can see the dependence on the various parameters 
explicitly, one can predict the asymptotic behavior for large values of x 
and t, and so on! The continuous case, which cannot be solved analyti- 
cally, might require extensive numerical computations to establish clear 
criteria. Thus the present approach provides valuable estimates. In fact, 
Abbott (1956), who used this type of analysis and applied it in detail to the 
river Severn, found remarkably good agreement with observations. (Actu- 
ally, Abbott developed his work in terms of high frequency approxima- 
tions, but the two approaches are mathematically equivalent. Moreover, 
there is no gain in justification; the tidal variation is “low frequency” and 
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one has to argue that only high frequency effects contribute toward 
breaking.) 

It is important, however, to extend (5.52) to include nonuniform flow 
and topography in the undisturbed state of the river. The narrowing of the 
river upstream is particularly crucial in making the actual estimates, since 
it favors breaking and is needed to offset the usually overpowering damp- 
ing of the frictional forces. The details can be found in Abbott’s paper. 

To apply the wavefront results we use the maximum rate of change in 
the tidal variation to determine h,(0). If the tidal variation at the mouth of 
the river is 


h=h)+asinot, 
the maximum value of h, is aw. In the discontinuity analysis the initial 


value of h, at the wavefront is ( Ve'hy — 09) A,(0). Therefore we choose 


aw 


0) = 


V 5΄ἢρ — Up 


For the uniform channel, (5.52) would predict bore formation if 


aw > το ν΄ ἢο -}( V σ΄ +509). 


2 


The formula can be written in various forms using g’S = Cyvg/ ho, and the 
least sensitive is probably 


τ» FG 1-—— |(+ 3 = ) (5.53) 
ν oho V EN 


For rivers to/ Vg’hy is fairly small, so the right hand side can be 
approximated by the first factor. For the typical values of vy=5 ft/sec, 
C= 0.006, this leads to unattainable values of the order of 100 ft for a. The 
effects of narrowing and other factors bring the value down considerably, 
although exceptionally high tides and rapidly changing topography are 
necessary. That is why very few rivers have bores. For the river Severn, 
Abbott finds that with all nonuniform effects included the required value 
of the tidal range 2a is 39.4 ft. The spring tides have an average range of 
41.4 ft while the neap tides have an average range of 22.2 ft. Thus Abbott 
predicts that bore formation should occur for about 4 days around the 
times of the highest tides. This appears to be borne out by observations. 
His predictions of the distance upstream at which the bore forms and of its 
maximum height appear to be in fair agreement. 
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5.8 Shock Waves 


The situation as regards the breaking of waves and the introduction of 
shock waves is very much the same as in the case of a single quasi-linear 
equation. Some solutions which are initially single valued, and even 
continuous, will develop multivalued regions: waves will break. This is 
again interpreted as an inadequacy of the assumptions leading to (5.1), but 
the appropriate saving features can be well approximated by allowing 
discontinuities in u. 

We again take the view that in formulating the differential equations, 
there will have been an earlier stage where the equations were in integrated 
form 


£ (hax+lailet [hidx=0, (5.54) 
xX X> 


where f,, g,, ἢ, are various quantities of physical interest in the problem. 
For example, in problems of mechanics f; and g, could be the density and 
flux of mass, or the density and flux of momentum, or the density and flux 
of energy. The quantity h, allows for a distributed source term, such as a 
body force in the momentum equation. Equation 5.54 is a conservation 
equation for the physical quantity concerned (mass, momentum, energy, 
etc.). 

The densities f, will be functions of (x,/) and of n basic variables 
u=(u,,...,u,); in general, there will be n equations (5.54) in the statement 
of the appropriate physical laws. Various simplifying assumptions will then 
be made to relate the g,, h, to x, t, u. At the first level of approximation, the 
g, and h, will just be functions of x, ¢, u. If u has continuous first 
derivatives, (5.54) may then be written in the differentiated form 


Of;(x,t,u) ᾿ dg,( x,t, u) 


7 a thy x,t,u) =0. (5.55) 


This is a differential equation in conservation form. 

If discontinuities in u are to be included, the integrated form (5.54) 
must be used and the dependence of g, and ἢ; on u left open at first. If a 
discontinuous shock occurs at x = s(t), exactly the same argument given in 
Section 2.3 gives the shock conditions 


-U[f]+[gl=0, [Ξ1,...,5, (5.56) 


where U is the shock velocity s(t). We then argue that in the continuous 
parts of the solution on the two sides of the shock, it is still a good 
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approximation to take 
2,=28,(x,t,u), h,=h,(x,t,u). 


Therefore (5.56) is applied with the same functional dependence of g, on u. 
As in Chapter 2, a more accurate choice of the g, will involve derivatives of 
u, and the shocks will be smoothed out into thin regions of rapid change. 
However, the discontinuity treatment is simpler and usually is adequate. 

The formal mathematical definition of weak solutions of (5.55), leading 
to the jump conditions (5.56), follows closely the discussion in Section 2.7. 
Evaluating the derivatives in (5.55), we see that it is a case of the system 
(5.1) in which 


of; 0g; 
aia “i Bu," (5.57) 


The discussion of weak solutions is applicable only to these special cases. 
Moreover the important warning about nonuniqueness must be empha- 
sized. In typical cases, starting from the relevant system (5.1), it will be 
possible to find more than n different equations in the conservation form 
(5.55). Shock conditions (5.56) from a choice of any n of them will be 
satisfactory mathematically, but only those n equations that correspond to 
the original physical statements in (5.54) will give the correct solutions for 
the problem. A good example of this nonuniqueness occurs in gas dyna- 
mics (see Chapter 6). In view of the nonuniqueness, the connection with 
the physical laws is stressed here. 


5.9 Systems with More Than Two Independent Variables 


We comment briefly on the situation for quasi-linear equations with m 
independent variables where m>2. The system may be written 


ὃ 
αὐτοῖς + b,=0, i=1,...,n, (5.58) 


where the dependent variables u, are functions of the m independent 
variables x!,x?,...,x, and the summation extends over y= 1,...,m, as well 
as j=1,...,n. The analogs of the characteristic curves for m=2 are 
characteristic surfaces in the m—1 dimensional x space. They may be 
introduced somewhat as before and some of their properties are similar. 
However, they are very much more limited in their usefulness for con- 


structing solutions. 


140 HYPERBOLIC SYSTEMS Chap. 5 


The limitation arises because it would be too much to expect, in 
general, that one could find linear combinations of the equations in (5.58) 
such that the directional derivatives of each u, have the same direction. 
The restrictions on the system or on the solutions would make them far too 
special. So the analog in this case has to be that the directions involved 
should lie in an m—1 dimensional surface element. If the appropriate 
linear combination is 


du, 
Lay — +1.b,=0, (5.59) 


the directional derivative for u, is aj. If the surface element belongs to a 


surface S(x)=constant, the normal vector is dS /dx”, and the orthogonal- 
ity requirement is 


aS Ν 
layne =0, 751,, πὶ (5.60) 


The condition for | to be nontrivial is that the determinant 


Hee be (5.61) 


Surfaces with this property are characteristic surfaces. Again the system will 
be hyperbolic if n independent equations of the form (5.59) can be found 
with this property. Usually these will correspond to μΜ different 
characteristic surfaces, but this is not necessary provided that the full 
complement of n vectors | can be found. However, these choices do not 
simplify the solution to the same extent as in the case m=2, since we are 
still left with m—1 coupled directions within each surface. 

In view of this, the main property of characteristic surfaces for wave 
propagation problems is that they carry singularities in the solution and, in 
particular, describe wavefronts. In close analogy with the approach in 
Section 5.5, let S(x)=0 be a surface across which the u, are continuous but 
the du,/dx” are allowed to have simple jump discontinuities. From the 
continuity of the u,, all the tangential derivatives must be continuous; 
hence only the normal derivatives can be discontinuous. If the surface 
S=0 is imbedded in a family of surfaces S=constant, so that S can be 
used as a local coordinate supplemented by any choice of the other n—1 
coordinates, the discontinuous derivatives are du,/0S. Then closely follow- 
ing the argument for m=2, we deduce that 


7 OS Ee - "- =0. (5.62) 


ἀν 9x" | OS 
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Therefore discontinuities can only occur on surfaces which satisfy 


» oS 


9% Ox” 


--0. (5.63) 


This is the same as (5.61) and defines the characteristic surfaces. Equations 
5.62 and 5.63 are the generalizations of (5.20) and (5.21). As before, further 
relations can then be derived for the jumps [du,/ dS]. 


5.10 Second Order Equations 


Second order linear equations taking the form 
9΄φ ὃφ 


Α 
appear frequently, and even in the case of two independent variables it is 
usually more convenient to leave them in this form than to work with a 
first order system. Indeed, we saw an indication in Section 5.2 that there 
may be some problem in finding a satisfactory equivalent system, unless of 
course (5.64) was derived from one. 

There are many approaches to the classification of (5.64). In the 
context of wave propagation, the possibility of wavefronts carrying discon- 
tinuities in derivatives is an important question, and it provides the 
simplest link to show consistency with the discussion of first order systems. 
Obviously, in those cases where (5.64) does come from a reasonable 
equivalent system the definitions of hyperbolicity should agree. A detailed 
proof of consistency is not attempted here, but the choice of this approach 
shows the close connection. 

Consider, then, the possibility of jump discontinuities in the second 
derivatives of q. If these occur across a surface S(x)=0, while q and 
ὃφ 7) 0x; Temain continuous, we may introduce local coordinates based on 
S(x)=0 as before and deduce that ὃ ῳ͵,9.5 2 is discontinuous but the other 
second order derivatives remain continuous. Then, taking the difference of 
the limits of (5.64) on the two sides of S=0, we have 


as as | ὃ΄φ 
1 Ox; | τε τὸ (5.65) 

A necessary condition for [8 ᾿φ,.8.5 2950 is 
ΓΦ (5.66) 
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It can be shown that discontinuities in the first derivatives or even in » 
itself (since the equation is linear) must also be restricted to such surfaces. 
But the arguments require more careful discussion, including the question 
of what exactly is meant by solution, and they are postponed until they are 
needed in Section 7.7. 

The classification now depends on the quadratic form A,&&. At any 
point x, it may be reduced by means of a linear transformation to the form 


aézt+--- +a,€2. (5.67) 


If all the a, are the same sign, there is clearly no solution of (5.66); the 
equation is elliptic at such a point. If some of the a, are zero, it is parabolic; 
in the usual case, one of the a, is zero and the remainder have the same 
sign. If the a, are nonzero, but not all of the same sign, we have the 
hyperbolic case. In applications it appears that the only hyperbolic cases 
that arise have m—1 of the a, with the same sign and just one with the 
opposite sign. An explanation of this is that surfaces described by (5.66) 
have peculiar geometrical properties otherwise, and, for example, could not 
represent the simple intuitive picture of an expanding wavefront. Accor- 
dingly the term hyperbolic is restricted to this case. 

To free the classification from its reliance on the discontinuity analy- 
sis, one merely notes that the linear transformation required to reduce the 
quadratic form A,&& to (5.67) may also be used to generate a local 
coordinate transformation which reduces the leading term in (5.64) to the 
form 


2 
ὌΝ 
ΙΧ ax2 ” OX? 

This is independent of any question of discontinuities and the classifica- 
tion proceeds as before on the’signs of the a, in the leading term. 

The discussion here has necessarily been kept brief on those questions 
which will not arise directly in the later work. Further reference may be 
made to the many excellent texts on the general theory of partial differen- 
tial equations, such as Courant and Hilbert (1962) or Petrovsky (1954). 


CHAPTER 6 


Gas Dynamics 


As explained in Chapter 1, many of the basic ideas for hyperbolic 
waves and, particularly, the elucidation of shock phenomena came from 
gas dynamics. This chapter is a discussion of waves and shocks in gas 
dynamics. It provides a natural illustration of the general ideas developed 
in the last chapter and adds the sort of amplification and extension that 
can only be shown on specific problems. But gas dynamics is of course an 
important and interesting subject for its own sake, so this chapter is 
presented as a thorough introduction, not just as an illustration of the 
mathematical theory. Further specialized topics are taken up in later 
chapters, and all the material combined provides a broad coverage of gas 
dynamics. However, the reader interested only in the general development 
of wave theory may skim this chapter. 


6.1 Equations of Motion 


The equations of motion for a compressible fluid are derived by 
writing the equations of conservation of mass, momentum, and energy for 
an arbitrary volume of the fluid. Each of these brings in the corresponding 
variables to describe the balance. The description of mass flow requires 
two quantities: the density p(x,7) and the velocity vector u(x, ἢ) at any point 
x at time ¢. Momentum requires additional quantities to describe the forces 
acting on the fluid. There may be a body force, usually gravity, acting on 
all the fluid in any volume. This is denoted by a vector F(x,?¢) per unit 
mass, and for gravity would be g times the unit vector in the vertical 
direction. There are also stresses acting across the boundary of any volume 
of fluid. The stress acting on any small element of area of the boundary 
surface is taken to be proportional to that area. In general, it depends also 
on the orientation of that surface element. Therefore the force per unit 
area will be a function of position x, time ¢, and the unit vector 1 normal to 
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the surface element. A standard argument, which will be detailed later, 
shows that the ith component p, of the stress may be written as a 


DP; =P,ilj (summed over /), (6.1) 


where the quantities p,(x, ἢ) depend only on the position x and time ἡ. Since 
p; and ἢ, are vectors, the components p,, form a tensor, and it is referred to 
as the stress tensor at (Χ, ἢ). The component p,; is the ith component of the 
force per unit area on an element of area whose normal is in the /th 
direction. 

The energy equation introduces still further quantities. The fluid has 
internal energy due to the thermal agitation of the molecules. In the 
continuum theory it is specified as a quantity e(x,7f) per unit mass. There is 
also heat flow across the boundary and this will be denoted by a vector 
q(x, ἢ) per unit surface area. 

We are now in a position to write down the conservation equations, 
although they will not yet provide a complete system since there are more 
unknowns than equations. We consider a fixed arbitrary volume V of the 
region occupied by fluid and write down the net balance for that region, 
bearing in mind the transport of the fluid across the boundary surface S. 
For the conservation of mass the rate of change of the total mass in V, 


J pav, 


is balanced by the flow across S. If I denotes the outward normal to S, the 
normal component of velocity across S is /ju,. Hence 


d 
a) pay [ehujds=0. (6.2) 


In similar fashion, the equation for the net balance of the ith com- 
ponent of momentum is 


d 
| pu,dV + | (pu,lu,—p,) dS= | pF,dV. 6.3 
ay J puaV + | (oulay—pas= [ (6.3) 


The first term is the rate of change of momentum inside V, the second 
term is the transport of momentum carried across the boundary by the 
flow, the third term is the rate of change of momentum produced by the 
stress p, acting across the surface S, and the right hand side is the 
momentum created inside V by the body forces. 

The total energy density per unit volume consists of the kinetic energy 
ou? of the macroscopic motion plus the internal energy pe of the molecu- 
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lar motion. For energy balance we have 
d ] 2 pee 5.5 
a ᾿ 5 Pui + pe} dV + [{ (pu? + pe) lu; — pu; + 1q;} dS = J prima. 


(6.4) 


The first term in the surface integral is again the contribution from bodily 
transport across the boundary, the second term is the rate of working by 
the stress p, at the boundary, and the third term is the loss or gain of heat 
by conduction across the boundary. The right hand side is the rate of 
working by the body forces. 


If discontinuities are allowed in the flow quantities, these integral 
forms will be needed. For one dimensional problems, the volume integrals 
become integrals over x, from x, to x,, say, and the surface integrals 
reduce to the differences of the integrands at x, and x,; they take the form 
quoted in (5.54) for the treatment of shock waves. In large regions of the 
fluid, however, the quantities will be continuously differentiable and we 
may take the limit as the volume V shrinks to zero in order to obtain the 
corresponding differential equations. In (6.2)(6.4) the time derivatives 
may be taken inside the volume integrals, since V is independent of t, and 
the surface integrals can be converted into volume integrals using the 
divergence theorem: 


| dv, 
[ jeds= ἢ or dV, (6.5) 
V 


for any continuously differentiable vector v, and any reasonably smooth Κ΄. 
Thus (6.2) may be rewritten as 


[ ΞΡ + τε (pu,) dV =0. (6.6) 
V . 


Since the integrand is continuous and (6.6) is true for arbitrarily small V, 
we deduce that 


0 
at ag δὰ (6.7) 


[If it were nonzero at any point, it would have the same sign in some small 
volume V, by continuity, and (6.6) would be violated.] If (6.1) is accepted 
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for the moment, (6.3) and (6.4) lead in a similar way to 


0 0 = 
ὃ, (pu;) + ax, (pu,u; — Ρμ) = pF;, (6.8) 


and 


l 0 ] 
ir zen t0e) + ay {( zou? tee)y—nmtg} ok. (69) 
J 


The usual argument to establish (6.1) is in fact a first approximation 
to (6.3). If the maximum dimension of V is d, then the volume V is O(d?). 
Any volume integral with continuous integrand is then O(d*), by the mean 
value theorem. The first surface integral in (6.3) is equal to the correspond- 
ing volume integral by the divergence theorem (6.5) and hence is also 
O(d*). Therefore (6.3) shows that 


Jp.as= O(d?) (6.10) 


for all S. The relation given in (6.1) is clearly sufficient, since the diver- 
gence theorem may then be used to show that (6.10) is satisfied. To show 
that it is also necessary, we first define quantities p,; for j=1,2,3, as the 
values of p; when the area element is perpendicular to the x,, x,, and x, 
axes, respectively. We then apply (6.10) to the special case of a small 
tetrahedron with three faces normal to the three coordinates axes. If the 
fourth face has unit normal I and area AS, the areas of the other three 
faces are the projections /,AS,/,AS,/,AS. Then (6.10) shows that 


PADAS = p, 1, AS + p2,1AS + p3,1,AS + O(d?), 


where p,(1) and the p,, are evaluated at appropriate mean value points in the 
corresponding faces. In the limit, as d—>0, we have 


PAY) = Pj!) + Po! + p3;/3, 


in agreement with (6.1). This is a rather inelegant proof but apparently 
there is no way to change it essentially. 

In connection with conservation equations, it is natural to ask whether 
the conservation of angular momentum adds anything new. For the x, 
component of angular momentum, we would have 


2 0 
py (Χ1ρ1.2 — X,pu,)+ “Ox, { (x pu, — x2pu,)U;— (2% Pj— X2Pj1)} 
Jj 
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with similar expressions for the other components. When (6.8) is substi- 
tuted in (6.11) most of the terms cancel and we are left with 


P\2> P21: 
Thus angular momentum leads to the symmetry of the stress tensor: 
Pa = Py: (6.12) 


It is valuable information, but it is a subsidiary equation compared with 
the others. 

Equations 6.7-6.9 provide five equations for the fourteen quantities 
0, U;, jis 4 6: To complete the system various additional relations are posed 
between the flow variables. 


6.2 The Kinetic Theory View 


It adds to the understanding of the various terms in the conservation 
equations 6.7-6.9 to note what they represent from a molecular viewpoint. 
The molecules have a whole distribution of velocities, and the flow 
quantities are related to the distribution function f(x, v, ἢ), which is defined 
so that 


f(x, v, ἢ dx, dx, dx, dv, dv, dv, 


is the probable number of molecules in a volume element dx, dx,dx, 
centered at x, in a velocity range dv, dv,dv, centered at v. Then the density 
and macroscopic velocity u are defined by 


οΟ οΟ 
p= [ mf av, pu; = { mo,f dv, (6.13) 
— © πο 
where m is the mass of a molecule and {dv denotes the triple integral over 


all values of v,,0,,03. The total flux of the ith component of momentum 
across a surface with normal I is 


f ” mov) fav. (6.14) 


If we set v=u+e, so that c measures the difference of the molecular 
velocity from the mean u defined in (6.13), this may be expanded to 


[ἢ mfde+ uf me, [ας Ὁ uf mc, fde+ [ πο Τάς) 
oO ~ © — ce — 00 
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From the definition of ¢ as the deviation from the mean, the middle terms 
vanish and we have , 


i{ou, u; + fo mer [ dc) (6.15) 


In a perfect gas where intermolecular forces are limited to relatively 
instantaneous “collisions” between the molecules, this is the only contribu- 
tion to the surface integral in (6.3) and we see that 


οο 
—p;= if mc,c,f de. (6.16) 
— & 
This agrees with the form in (6.1) and shows that the stress tensor is 
ioe) 
— p= f mec, fde; (6.17) 
— οὐ 


the symmetry (6.12) is immediate. Thus the stress contribution in (6.3) may 
be interpreted as the additional momentum flux by the motion of the 
molecules relative to the mean. 

Each molecule has translational kinetic energy 4mv?. The molecules 
may also have vibrational or rotational energy but, for the present, we take 
the case of a monatomic gas for which these additional forms of energy 
are absent. The total energy per unit volume is then 


Ἰὼ I “1 
=~ ΟΣ ὧν = — ou2 —mce2f de. 
J zmeitan peur f  ymeifde (6.18) 


Therefore the internal energy term in the volume integral in (6.4) may be 
interpretated as the additional energy of the molecular motion relative to 
the mean, and we have 


οο 1 2 
pe= [ 3 me; fde. (6.19) 


The energy flux across an element of surface with normal ] is 


“΄.- 
᾿ς πιο} 04}. 
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In terms of the quantities already defined, this can be broken down into 
I(pu7u, + peu, — p,u; + q;)s (6.20) 
where 


0 | 
45 [ 5 mejc,f de. (6.21) 


By comparing with (6.4), we see that (6.20) agrees with the energy flux in 
(6.4), and the heat conduction q is interpreted as the transfer of excess 
molecular energy by molecular motion. 

Even for the discussion of an ideal gas, it is important to include the 
vibrational rotational energy possessed by diatomic and more complicated 
molecules. This energy should be added to the expression in (6.19) and 
there will be a corresponding contribution in the heat flux vector (6.21). A 
basic result in statistical mechanics is that the different forms of energy 
reach an equilibrium value with equal contributions from each degree of 
freedom. This will allow us to generalize (6.19), when necessary, without 
going into details. 

These interpretations of stress, internal energy, and heat conduction in 
terms of the random molecular motion show that the various quantities 
introduced in (6.7)(6.9) are not just a rather ad hoc choice representing 
any important effects we can think of, but they follow a consistent scheme 
using higher and higher moments of the velocity distribution f. Indeed in 
kinetic theory proper, a basic equation is proposed for f and then the 
conservation equations (6.7)-(6.9) are deduced as consequences of it. The 
equation for f is usually taken as the Boltzmann equation or some 
approximation to it. 

We return now to the continuum equations. 


6.3 Equations Neglecting Viscosity, Heat Conduction, and Relaxation 
Effects 


For a gas in equilibrium in the absence of body forces, we have the 
following: 


1. The stress on any element of area is normal to the area and 
independent of its orientation. Hence 


Pj = — PS ip (6.22) 
where p is the scalar pressure. 
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2. The heat conduction 


q,=9. (6.23) 


3. The internal energy is a definite function 
e=e(p,p) (6.24) 


of the pressure and density. The form of this function is established by 
experiment and various thermodynamic arguments. 

When the gas is nonuniform and in motion, none of these is strictly 
true. However, provided time and space derivatives are not too large, they 
are still good approximations for many purposes. With them the basic 
conservation equations become a complete set for the five flow’ quantities 
p,p,u;. They are 


dp 8 
Bt t Ox, eum O (6.25) 
d(pu;) 8 Op 
πὰ ax, ity) + = pF; (6.26) 
<5 u? + pe) + ὃ (5-00? ρθε) = ρ ἔμ, (6.27) 
Ot 2 p i Ox, 2 ri J 11" e 


When these equations predict shocks or other regions of high gradients the 
assumptions may need to be improved. 

The first assumption, (6.22), corresponds to neglecting viscous effects 
and would be improved in the Navier-Stokes approximation by adding 
terms linear in the velocity gradients du,/dx,;. The second assumption, 
(6.23), neglects heat conduction and would be improved by taking q 
proportional to the temperature gradient. The appropriate forms for the 
Navier-Stokes equations are 


ἘΝ ΓΗ" ou δ, Ἐ ΒΟ 6.28 
Pyi~ ~ PO; ar Ox, i μ ax, ax, ᾿ ( : ) 
me ΔΤ 
q;=—% ae, (6.29) 


where p» and A are the coefficients of viscosity and heat conduction, 
respectively. The temperature 7 15 related to p and p by the equation of 
state of the gas. 
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The third assumption in (6.24) assumes that the gas is in local 
thermodynamic equilibrium. In changing flows the internal energy is 
always tending toward the equilibrium for the new conditions, but there is 
a time lag, particularly in the adjustment of the rotational and vibrational 
energy. This is a so-called relaxation effect and the typical time lag is 
referred to as the relaxation time. This is an interesting but rather special 
topic so the details are postponed and given as an example in Chapter 10. 


6.4 Thermodynamic Relations 


We could just take the view that e(p,p) in (6.24) is some empirical 
function that is given to us. However, the arguments developed in 
thermodynamics not only provide us with formulas but suggest the impor- 
tant quantities to consider. It is appropriate to note here only the 
mathematical steps we require, and refer the reader to the numerous 
standard texts for motivation and a study of the deeper significance of the 
issues. 

The differential form 


de+ p a{+) (6.30) 


plays a fundamental role. It arises first in considering the consequences 
when a small amount of energy is added to unit mass of the gas. If the 
energy is added relatively slowly so that there is no violent change in the 
pressure, the work done in expanding the volume 1/p by d(1/p) is 
pd1/p). The rest of the energy must go into increasing the internal energy 
by de. In these circumstances (6.30) is equal to the amount of energy 
added. But in any event, for given e(p,p) it is a differential form in two 
variables p,p. By Pfaff’s theorem, this always has an integrating factor, so 
that there exist functions 7(p,p) and S(p,p) such that 


T ds = de+ pd{ + | (6.31) 

(— This simple mathematical statement acquires its deep significance from the 
fact that T is the absolute temperature and S is the entropy. 

In more complicated systems, other thermodynamic variables (such as 

the concentration of different phases of the substance) appear besides p 

and p. Then the differential form corresponding to (6.30) involves more 

than two variables. On purely mathematical grounds one can no longer 
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claim that there is always an integrating factor to relate it to a perfect 
differential. However, the basis of thermodynamics is that this will always 
be so for all real physical systems, and, moreover, the integrating factor 
will always be the absolute temperature. 

The mathematical step in (6.31) seems to introduce T and S as 
subsidiary derived quantities from a given e(p,p). But they play an equally 
fundamental role, and (6.31) should be viewed more as a relation between 
equally important quantities. 


Ideal Gas. 


Under normal conditions most gases obey the ideal gas law 


p=RopT, (6.32) 
where & is a constant. When that is the case, we can write (6.31) as 


_ # 


dS τα -π 


—d( logp). 
It follows that de/T must be a perfect differential and therefore e is a 
function of T alone: 


e=e(T). (6.33) 


It is interesting that (6.33) can be deduced from the assumption (6.32), but 
actually (6.33) is more fundamental. 

Equations 6.32 and 6.33 describe an ideal gas. In the equations of 
motion it is convenient to express e as a function of p and p. We see that 
for an ideal gas e is a function of p/p. The form of this function could be 
left open, but in fact a rather simple formula covers a wide range of 
phenomena in gas dynamics. It arises in considerations of the specific 
heats. 


Specific Heats. 


When heat is added slowly to unit mass of gas, it may be distributed 
between internal energy and volume change in various ways, provided that 
the sum in (6.30) is equal to the heat added. A specific heat is defined as 
the ratio of the heat added per unit mass to the temperature change. If the 
fluid is kept at a constant volume, the heat added goes entirely into 
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internal energy; hence 
de =c,,aT, (6.34) 


where c, is the specific heat at constant volume. Alternatively, if the 
pressure is kept constant, and the fluid is allowed to expand, we have from 
(6.30) 


P 
{415 )=car, (6.35) 


where c, is the specific heat at constant pressure. The quantity e+p/p 
which appears here and also, significantly, in the flux term in (6.27), is the 
enthalpy 


haete. (6.36) 


From (6.32) and (6.33), we see that for an ideal gas e,h,c,, and Cc, are 
functions of the temperature alone. 


Ideal Gas with Constant Specific Heats. 


It is found empirically, however, that it is a good approximation to 
take the specific heats constant over large ranges of temperature. Hence 


e=c,T, h=c,T. (6.37) 
Since the difference of these is p/p, the gas law (6.32) follows with 
C—O ER. 


If the ratio of the specific heats y=c,/c, is introduced, we have 


C= Cy» R =(y—1)e,, 
(6.38) 
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hence 


S=c, log a + constant, 


or, alternatively, 


p=Kpres/%, (6.39) 
where κ is a constant. 
An ideal gas with constant specific heats is sometimes referred to as a 
polytropic gas. 


Kinetic Theory. 


Some of these relations have a simple kinetic theory interpretation 
which is worth noting. First, the temperature T measures the average 
kinetic energy per molecule in the translational motion of the molecules. It 
is normalized so that this energy is kT, where k is Boltzmann’s constant. 
For an ideal monatomic gas, this is the whole of the internal energy, so 
that 


e= 3 kTn, 
where x is the number of molecules per unit mass. Thus the expression for 
e as a linear function of T is essentially an identity in this case. 
In equilibrium, the expression in (6.17) for the p,, must reduce to 
— p6;;, Therefore the pressure may be related to the molecular motion by 
the formula 


2 
| τριΞ Ak me; f de. 


But the translational energy (6.19) involves one half of the same integral 
and is equal to 4k7Tnp; hence we must have 


p=knoT. (6.40) 


This is the ideal gas law with & =kn. The number of molecules per unit 
mass is Avogadro’s number N divided by the molecular weight of the gas. 
Hence the constant “2 used here is the universal gas constant kN divided 
by the molecular weight of the gas. 

When the molecules have other forms of internal energy, such as 
vibrational or rotational energy, it is a basic principle of kinetic theory 
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that, in equilibrium, the average energy in each degree of freedom is the 
same. The temperature is defined so that the energy per molecule is 4kT 
for each degree of freedom. Thus it is ΞΚΤ for the three translational 
degrees of freedom. When there are a degrees of freedom the average 
energy per molecule is }akT. Therefore the average energy per unit mass is 


ἐξ 5 akTn, (6.41) 


The relation for p is unchanged, since p is related to the translational part 
of the energy. Combining these various results, we have 


e= FaKT, h=(Sa+1)Q7, p=RopvT, (6.42) 


and we deduce that 


= Za, C =(5a+1)%, γειῈ 5. (6.43) 
These agree in form with the expression obtained earlier for an ideal gas 
with constant specific heats, but they have the extra feature that formulas 
for c,, c,, and y are included. 

For a monatomic gas, a=3, y=5/3. For a diatomic gas including two 
rotational degrees of freedom, a=5, y=1.4; this is a good approximation 
for air. 


6.5 Alternative Forms of the Equations of Motion 


The conservation forms of (6.25)-(6.27) correspond to the integrated 
forms (6.2)-(6.4) and will be needed for the treatment of shocks. But for 
other purposes the equations may be simplified. It is convenient to intro- 
duce the operator 


for the time derivative following an individual particle. The mass equation 
(6.25) may be written 


DB EOE 
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from (6.25), the p derivatives can be eliminated in (6.26) to give 


Du; op - 
oe ale ha ad sl 


The energy equation (6.27) can be written in various forms. First, using the 
two other equations, we can reduce it to 


De du; -- 
PH +p ax, =(). (6.46) 


Then, from (6.44), an alternative form is 


T=? Ξῇ (6.47) 


That is to say, the entropy remains constant following a particle. Flows 
satisfying (6.47) are usually called adiabatic. 

It should be stressed that the arguments leading to (6.47) are purely 
mathematical manipulations of the conservation equations. One could, in 
principle, have been led to introduce an “interesting quantity S(p,p)” in 
this way, without any prior knowledge of thermodynamics. It is reassuring 
to have the results here on that basis. The discussions of (6.31) in 
thermodynamics refer to infinitely slow reversible changes, and we might 
appear to be using them unjustifiably outside that context. However, once 
the assumptions that p;, = — pd,;, and e=e(p,p) are adopted, the rest is 
mathematics and consequences such as (6.47) follow without any restric- 
tion to slow flows in the thermodynamic sense. 

Since the expression for S in terms of p and p may be solved in 
principle as p=p(p,S), we may use | 
Dp ,Ὁρ a | ΟΡ 


δ᾿ Dt Δρ 


(6.48) 


S=constant 


as an equivalent form of (6.47). The quantity a will subsequently be 
identified as the sound speed. 

Unless the conservation form of the equation is particularly required, 
it is usual to work with (6.44), (6.45), and either (6.47) or (6.48). It will be 
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convenient to collect them together for future reference: 


pt + ΕΝ =pF;, (6.49) 
For a polytropic gas 
e= =e S= clog, aq? = τ (6.50) 


The entropy equation shows simply that the entropy remains constant 
on each particle path. In general, it may take different values on different 
particle paths. However, if the fluid is initially at rest with uniform entropy 
So, it follows that S= Sp on each particle path and hence remains uniform 
in the motion. Such flows are called isentropic. When this is the case, p is a 
function of p alone, and the equations reduce to the first two in (6.49). For 
a polytropic gas 


p=kKp". 


This argument requires modification when shocks or other discon- 
tinuities are present. The differential equations, and in particular the 
entropy equation, apply only to regions where the functions are differenti- 
able. Across a surface of discontinuity the entropy jumps, and in general 
the amount of the jump will vary with time and position as the surface 
propagates. Thus an initially isentropic flow may not remain so after a 
shock passes through. This will be discussed in detail in Section 6.10. 


6.6 Acoustics 


The first information on wave propagation in gas dynamics is pro- 
vided by the theory of acoustics, which refers to the linearized theory of 
small disturbances about an equilibrium state. The simplest case arises 
when body forces are neglected and the equilibrium state is taken to have 
constant values p=pp, p=Pp, u=O0. If the initial disturbance also has 
uniform entropy, the motion remains isentropic and we may take p=p(p). 
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Then for small perturbations 


P —Po=%5(P— Po) (6.51) 
to first order, where 


a5 =P'(p). (6.52) 


Relation 6.51 may be viewed as the solution of the linearized form of the 
third equation in (6.49), and we turn to the linearization of the first two. 

To first order in the small quantities (p—ppo)/Po, (0—Po)/Po, U/ ao, 
and their derivatives, we have 


dp ὃ: 
5 + Pox τὸ, (6.53) 

du; dp 
ρος; + = =0. (6.54) 


(Here and elsewhere, when the unperturbed state is constant the deriva- 
tives are left in terms of original quantities to save writing or the prolifera- 
tion of subscripts, but they are transposed into derivatives of the perturba- 
tions as the need arises.) 

From (6.54), 


4, = u(x) + τς mah (p—po) dt, 


oar 


where arbitrary functions u(x) arise in the integration. It is usually 
appropriate in acoustics to take the functions u(x) to be zero; this is so, 
for example, if u,=0 initially or if the waves move out into a region at rest. 
Then the vector u is the gradient of a scalar. If we introduce the velocity 
potential φ defined by u= Vq, we have 


op dp Po ὃφ 
τε, P~ Po™ ~ Poa: OOO a2 Ot (6.55) 


and (6.54) is satisfied identically. The equation for @ is obtained by 
substitution of these expressions in (6.53); the result is the wave equation 


Pn = ASP xx, (6.56) 


for φ, with a) as the propagation speed. It may be noted also that all the 
perturbations in (6.55) satisfy the wave equation. 
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For one dimensional waves (6.56) may be solved immediately to give 
p= f(x — dot) + g(x + aot), 


where f and g are arbitrary functions; the corresponding expressions for u 
and p— pp are 


u=f'(x— dot) + 2'(x + apt), 

P~P, } (6.57) 
—— ΞΙΌ - agl)— g(x + ap). 
Po%o 

The functions f and g are chosen to fit initial or boundary conditions. We 
defer the discussion of specific examples because the full nonlinear equa- 
tions for plane waves are tractable, and some of the iinearized results can 
be seen as approximations to exact solutions. Two and three dimensional 
solutions of the wave equation are considered in Chapter 7. 

Practically any problem of acoustics takes place in the presence of a 
gravitational field and, as a consequence, the unperturbed state is not 
uniform. In problems of propagation over large distances in the at- 
mosphere or in the ocean, these effects may be crucially important and 
produce amplification and refraction of the sound waves. Even when the 
preceeding theory is adequate, it is not so much that the whole gravita- 
tional term pg is negligible, but rather that its perturbed value may be small 
compared with other perturbation terms. The undisturbed pressure and 
density must satisfy 


AP o 
P08 (6.58) 


where z is the vertical coordinate. Since the changes of pressure and the 
accelerations in sound waves may be extremely small, the two terms in 
(6.58) may be the largest terms in the vertical momentum equation. But the 
point is that they balance each other, and their further effects in the 
perturbation equations may be negligible. We consider the case of vertical 
propagation of plane waves in detail. If we set p=p,(z)+p,, p=po(z) +p), 
u= (0,0, w), in (6.49) and neglect quadratic terms and higher order terms in 
Ρ,»ρΡ1» Ww, we have 


Pip + Wo + Pow, =O, 
ρον, + Pot Piz = — PoS— P18; (6.59) 


Py,t WPo— ag(P, + WPo) =0. 
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The equilibrium entropy distribution is not uniform, in general, so that 
entropy changes must be included and a? is defined as in (6.48). 

From (6.58), the variations of the equilibrium quantities take place 
over a length scale L of the order of ας. If p, = O(ep,) and A is a typical 
wavelength in the perturbations, 


; Po €Po 
po= O( = r= (5°), 


While A/L may be 1074, the amplitude ε may easily be as low as 10 “ or 
lower, so the ambient gradients py may be greater than the gradients p,, 
produced by the sound waves. However, the terms pp and — ppg in (6.59) 
cancel, and the remaining terms are all proportional to ε. The terms 
0, 2,WPo,Wp,, have an additional factor A/L. Therefore, unless the propa- 
gation is over distances comparable with L, the nonuniform effects would 
be small. Rather than make further estimates it is simplest to look at some 
exact solutions of (6.59). 

By routine elimination of p, and p, in (6.59), and further use of (6.58), 
we find 

2\/ 
Wy = aew,, + (Poa) W,. 
Po 


This equation is hyperbolic and the characteristic velocities are + a)(z). In 
the case of variable atmosphere, ay is still the sound speed in this precise 
sense. For a polytropic gas, αζΞξ γρο ρου, So that (946) = ypo= — γροδ. 
Hence the equation reduces to 


ore ΗΝ 
Wiz = ἄρνν,; — YEW; 


Isothermal Equilibrium. 


For constant equilibrium temperature, aj is constant and (6.58) gives 
an exponential atmosphere 


RT, a 
z)=p,(O0)e~ 7/7, H= = —, 
Po(Z) = Po(9) 5 ye 
The equation for w has periodic solutions 
w= Ae?/2H cos(kz — wt), 
2 
a 
w? = azk? + Lea 


4H? 
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The variation of amplitude is small provided z« H, and w?=apk? provided 
\*/H?<«]1. This solution confirms the previous estimates. 

Convective Equilibrium. 


In convective equilibrium, the entropy is constant and pxp’. From 
(6.58), 


hence 
ag(z) = αξ(0) — (γ -- 1) gz. 


Of course this distribution is realistic only below the height a3(0)/(y—l)g. 
Solutions for w can be obtained in terms of Bessel functions (Lamb, 1932, 
p. 546), and similar conclusions about the effects of nonuniformity may be 
reached. 

Some questions concerning the refraction of nonplanar waves will be 
considered in Section 7.7; other aspects can be found in Lamb (1932, pp. 
547-561). 


6.7 Nonlinear Plane Waves 


We now consider the exact nonlinear equations for one dimensional 
flows in the case that body forces can be neglected. Since the interest is 
now in large pressure changes, gravitational effects can be wholly neg- 
lected for many applications. 

The equations in (6.49) reduce to 


p, + up, + pu, =0, (6.60) 
p(u, + uu,,) +p, =9, (6.61) 
S,+uS,=9, (6.62) 


where p(p,S) is a known function, and the last equation can also be 
written as 


p, + up, — a*(p, + up,) =0, (6.63) 
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where 


a= | a ; (6.64) 
dp S=constant 


To study nonlinear waves the equations are manipulated into 
characteristic form, following the procedure described in Section 5.1. 
Rather than quote formulas, it is quicker to develop the characteristic 
equations directly from (6.60)-(6.63). We note first that (6.62) is already in 
characteristic form, with a characteristic velocity u. Hence 


=0 on characteristics τε =U. (6.65) 
These characteristic curves are the particle paths, and S remains constant 
on each one of them. 

The other two families of characteristics are most conveniently found 
using (6.60), (6.61), and (6.63). It is sufficient to take the linear combina- 
tion: /, times (6.60) and /, times (6.61) added to (6.63). After rearrange- 
ment this combination is 


p,+(ut+l)p, + pl,(u, + uu,) + pl,u, + (1,—a*)(p,+ up,) =0. 


The choice /, = /,=0 corresponds to the characteristic form already found 
in (6.65). After that case is eliminated, it is clear from comparison of the 
terms in p and p that the only possible characteristic equations with 1,40 
require the p derivatives to be absent altogether: /, = a’. Then it is observed 
from the p and u derivatives that we require /,=/,/1,. The conclusion is 
that /, =a’, ,= +a, and the required combinations are 


DP, + (uta)p,+pa{u,+(uta)u,} =0. (6.66) 


The full set of characteristic equations can be written 


dp du _ ax 
a t Pa Gy = on Cyt =urta, (6.67) 
PM - 0 ἐὰν 2... aaa (6.68) 
dt dt τ΄ dt 
GS 9 on P-“an. (6.69) 
dt dt 


The characteristics C, and C_ represent points moving with velocity +a 


Sec 6.7 NONLINEAR PLANE WAVES 163 


relative to the local velocity u of the fluid. These are the sound waves and 
_aas defined in (6.64) is identified as the nonlinear sound speed. 
In the linearized theory, these equations are approximated as 


dp du _ ax _ 
a toto G, =9 on σεῖς; = 40 
dp du _ ax 
aS _ AX _ 
i Ξ- on P: di 0, 


and may be integrated immediately to 
(P—Po) + Podou = F(x -- aot), 
(p— Po) -- Podgu = G(x + apf), (6.70) 
S—S)= H(t). 


If entropy changes are absent these agree with the solution in (6.57). 

In the nonlinear theory the defining relations for the characteristics 
depend on the solution yet to be found, and the integration is not 
. straightforward. 

For isentropic flow S=constant everywhere, so that (6.69) can be 
dropped. Moreover, p=p(p), a*=p’(p), so that the first two characteristic 
equations can be written 


a(p)dp 
[EA + um constant on C5 ease 


p dt 
a 4 
[OS τωι on C_ Ἔχε oe) 
p dt 


These are the Riemann invariants. For a polytropic gas 
ρΡΞκρῖ, a°’=xyp™™', 


and the Riemann invariants are 


2 
y- 


τα: 4 =constant on outa, (6.71) 
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6.8 Simple Waves 


If, in addition to being isentropic, the flow has one of the Riemann 
invariants constant throughout, the solution is enormously simplified. It 
corresponds to wave motion in one direction only and in the linear theory 
would correspond to either F or G=0 in (6.70). As a basic model to 
illustrate how this type of “simple wave” may be produced, consider the 
waves produced by the prescribed motion of a piston in the end of a long 
tube. Figure 6.1 is the (x,t) diagram. Provided shocks do not appear to 
violate deductions from the differential equations, it may be argued that 
the flow must be a simple wave. For clarity, the argument is presented for 
a polytropic gas, but the extension to the more general form is obvious. 

The gas is assumed to be at rest with a uniform state v=0, a= ἀρ; 
S= S$, in x 20 at t=0, and it is assumed provisionally that shocks are not 
formed. Since the piston is itself a particle path, it is clear that all particle 
paths originate on the x axis in the uniform region. From (6.69), S remains 
constant on any particle path P, and hence it is equal to its initial value Sp. 
But the initial value is the same on every particle path. Therefore 


throughout the flow. Since the flow is then isentropic, we may use (6.71) 
for the other two families of characteristics. 

The C_ characteristics have a lower value of dx /dt than the particle 
paths, and therefore they all start on the x axis in the uniform region (see 
Fig. 6.1). On each of them 


(6.73) 


u=0, a=a9, S=So x 


Fig. 6.1. Expansion wave produced by withdrawing piston. 
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because this Riemann invariant is constant on each of them, and we have 
inserted the initial value from the region u=0, a=a,). Again, since the 
initial value is the same on every one, we have deduced that the Riemann 
invariant (6.73) is the same constant throughout. These are the arguments 
to establish that the solution is a simple wave. We now turn to the other 
characteristic equation in (6.71) to determine the remaining details of the 
solution. 

For those C, that originate on the x axis, (6.73) also holds with the 
opposite sign. Hence u=0, a=ay in the region covered by these C,. We 
deduce that the original uniform conditions apply in the region ahead of 
the characteristic C2, which separates those C, meeting the x axis from 
those meeting the piston. Since we are assuming the flow is continuous 
with no shocks, we have u=0, a=a, ahead of and on 6. Hence CY is 
given by 


X= dol. 


For those C, that meet the piston, we use (6.71) with the positive signs: 


2a 


aa +u=constant on each C, IX ag 


dt 


In view of (6.73), which holds everywhere, this may be reduced to 


y+1 


dx 
t 2 


u=constant on each C, :—— =ayt+ u. (6.74) 
The value of u on each one will be different, depending on where it meets 
the piston, but we see in general that the family of positive characteristics 
will be a family of straight lines, each having a slope ay+ {(y+1)/2}u 
corresponding to the value of u that it carries. 

The boundary condition on the piston is that the fluid velocity is 
equal to the piston velocity. Therefore if the piston path is given by 


x = X(t), the boundary condition 15 
u= X(t) on x=X(f). (6.75) 


For a C,, that intersects the piston path at time τ, it follows that u= X(r) 
along it, and then from (6.74) that its equation is 


x= X(1)+{ aot ἘΠ ΧΟ) - τ). (6.76) 
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Thus the solution is 
; > aot ae 
u=X(tT), a=ayt -5 X(1), S= So, (6.77) 


with τίχ, ἢ) determined implicitly from (6.76). 

_ Since the C, are straight lines with slope dx /dt increasing with u, it is 
clear that the ἘΝ ἀδι εἰ τς will overlap if u ever increases on the piston; 
that is, X(r) >0 for any τ. This is the typical nonlinear breaking described 
in Fig. 2.1 and it shows that shocks will be formed in such cases. If u 
increases, so do a, p, and p, so that breaking occurs and shocks appear in 
compressive parts of the disturbance. When shocks appear, we shall need to 
reexamine the arguments leading to (6.72) and (6.73) as well as discuss 
appropriate shock conditions. 

For expansion waves the solution in (6.76) and (6.77) is complete. The 
limiting case in which the piston is suddenly withdrawn with velocity — V 
is of special interest. There is a uniform region with 


as) 
u=—V, απαη-- "  Υ (6.78) 


next to the piston, and the adjustment to this from the initial undisturbed 
region is through a centered fan of characteristics as shown in Fig. 6.2. In 
the fan, since all the characteristics meet the piston at x =1+=0, they are 
given by 


+] 
v= (apt ἘΣ τα), —-V<ux<0. 


All the values of u between 0 and — V are taken instantaneously at the 
origin, but each value leads to a different member of the fan. Solving this 


Fig. 6.2. Centered expansion fan. 
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relation for u and adding the expression for a from (6.73), we have 


If the piston is pushed forward with positive velocity V, the fan is reversed 
and forms a multivalued region, which can be envisaged as a fold in the 
(x,t) plane (compare Fig. 2.3). It corresponds of course to immediate 
breaking and must be replaced by a shock. 

For other problems the simple wave argument applies in general near 
the front of any disturbance propagating into a uniform state. There is a 
region in which particle paths and one set of characteristics originate in the 
uniform state ahead, so that it is isentropic and one Riemann invariant is 
constant. The other family of characteristics “carries” the disturbance; on 
each of them, flow quantities remain constant and each of them is a 
straight line. The simple wave region extends as far back as the first 
particle path from the nonuniform region. Figure 6.1 applies with the 
piston path replaced by this bounding characteristic. The occurrence of 
such simple wave regions adjacent to uniform regions is well illustrated in 
the initial value problem of Section 6.12. 


6.9 Simple Waves as Kinematic Waves 


In Section 2.2 we saw that the continuity equation 
p,+4,=0 (6.80) 


together with a functional relation q= Q(p) leads to the simplest nonlinear 
waves. The simple waves discussed here can be viewed in this light. 
Whereas in gas dynamics there are three basic equations for the three 
quantities p, p, u, the special simple wave argument uses the two integrals 


Ν 2a _ 2a 


This means that two of the equations may be eliminated and any two of p, 
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p, u can be expressed as functions of the third. We are then reduced to one 
equation, which may be taken in conservation form, and we have a 
functional relation between the flux and the density. 
For example, if we choose to express all quantities in terms of p, the 
condition for isentropic flow provides the relation 
Y 


p=Pp = 
™ Po)” 
and the Riemann invariant provides the relation 
7 _ 2a _ 2% 
2 (y-1)/2 
S20 [2] ath (6.81) 
ΥΞῚ Po 


We may then take the mass conservation as the final equation to determine 
p. It is in the form (6.80) with g=pu. Therefore the kinematic wave 
formulation is to take (6.80) and 


q=Q0(p)=pV(p), (6.82) 


with the V(p) given in (6.81). In this case the function Q(p) is obtained 
from the other two differential equations rather than being proposed as 
part of the original formulation of the problem. But the analysis may be 
continued as in the kinematic theory. The equation for p is 


p,+c(p)p,=0,  c(p)=Q'(p), 
and we verify that 
c(p) =Q'(p) =V(e) +pV'(p) 
=V(p)+a(p) 


from the above relations. The conclusion, in agreement with the first 
derivation, is that flow quantities remain constant on characteristics and 
the characteristic velocity is u+a. 

This is essentially the approach used by Earnshaw (1858) in one of the 
earliest derivations of the solution. He assumed isentropic flow from the 
outset and wrote the equations as 


p, + up, + pu, =0, 


a*(p) 


u,+ uu, + p, =0. 
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Then, based on the observation that a linearized acoustic wave moving to 


the right has u=a,(p—p,)/pPp, he considered the possibility of exact solu- 
tions with u= V(p). The equations reduce to 


po, + (V+pV’)p, =9, 
Ne, τΣ 
(p,+ Vp,)V'+ ae ἄν 
Therefore, for consistency, 
Vs=t-, 
and the common form of the equation is 


p,+(Vta)p, =0. 


The choice of the upper sign leads to 


rio)= [ we) p= 2. (a(p) α} 


γ- 


and agrees with the above. Riemann (1858) gave the deeper argument 
presented in the last section. 

The choice of p as the working variable brings the description closest 
to the description in Chapter 2, but the formulas are simpler in terms of wu. 
We can transfer freely between the two by means of (6.81). In terms of u 
the basic equation is 


+] 
ΟΝ tu \u,=0 (6.83) 
and we have 


2 


Gas u, S= Sp, 


to determine p and p. The propagation speed is 


y+I1 


c(u) ξ αρ 5 


u. 


The solution of (6.83) is then found as before. For the piston problem, in 
particular, the solution is given by (6.76) and (6.77). 
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In the simple wave solution, at least, waves break in exactly the way 
described in Chapter 2, and the solution has to be completed by the 
introduction of shock waves. 


6.10 Shock Waves 


When waves break, realistic solutions are recovered by fitting in 
discontinuities, and we do this with the same general point of view 
developed in Chapter 2. In the region where breaking occurs, derivatives 
become large and, strictly speaking, the assumptions in (6.22)-(6.24) be- 
come inadequate there. But the real behavior usually can be closely 
approximated by introducing discontinuities satisfying the appropriate 
shock conditions and retaining (6.22)-(6.24) in continuous parts of the 
flow. Subsequently the detailed shock structure can be examined by 
including the effects of viscosity and heat conductivity. 

As noted earlier, the simple wave arguments leading to the integrals 


2a _ 2a 


δπδυ τοῖν a 


have to be reexamined once shocks are formed, and we have to return to 
the complete system of three equations for the discussion of discontinui- 
ties. 

The shock conditions are derived by the arguments presented in 
Section 5.8. It is crucial that we work with the equations in conservation 
form, and that we choose the three equations that are known to remain 
valid in the integrated form. To make the correct choice it is necessary to 
go back to the original integral formulation in (6.2)-(6.4). Specialized to 
one dimension and omitting the body force (although it would not affect 
the jump conditions), they are 


d [δι τ 
dt a 6.84 
ai Jy, 04+ Loud, 0, (6.84) 
4 τ 2. Phe 
ir J, Oude + Low? — Pri ],,=0, (6.85) 
4 Γι. ἫΝ ᾿ a 
Pdi =x pu + pe ax + 7 Pu + pe u Ριιῖ Ἔ αὶ Ξε. (6.86) 
dt J. 12 ᾿ 


Each of these is in the form (5.54) and the corresponding jump condition is 
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(5.56). After deriving the jump conditions, the assumptions p,,= — p, 4, =0, 
e=e(p,p) are again used for the continuous flow on the two sides and 
therefore may be inserted in the jump conditions. Then they become 


— U[p]+[pu] =0, (6.87) 
— U[pu}]+[pu?+p]=0, (6.88) 


(5 eu?-+ pe )u+ pu =0. (6.89) 


- u| yeu? + pe] + 


The corresponding differential equations in conservation form are 
p, + (pu), =0, 


(pu), + (pu? +p), =0, (6.90) 


( 5 ou?+ pe) + (5 ou?-+ pe) u+ pu =(. 
2 : 2 : 
These are equivalent to the set (6.60)-(6.62). 

Another conservation equation can be derived from this set: 


(pS), + (ouS),,=0, (6.91) 


which follows immediately from (6.60) and (6.62). But this does not apply 
more generally in integrated form. In fact, from (6.7)-(6.9) and (6.31), we 
have 


(Pit+P)Uu,.- Nx 


(pS), +(puS),. = 7 


(6.92) 


Hence 
d | xy ἰὼ (ρει Ἐρ)μ. -- 415 
» fi pS dx +[puS |, = " Τ dx. (6.93) 


The term on the right in (6.93) is crucially different from the source term h, 
in (5.54), since it involves derivatives of the flow quantities and there is no 
way to integrate them out. Therefore the argument leading to (5.56) does 
not apply. (It should be remembered that the assumptions p,, = —p, g,=0 
are introduced only after the appropriate limits have been taken.) Thus the 
jump condition corresponding formally to (6.91) cannot be derived. In 
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fact, we shall show later from (6.87) -(6.89) that 


~U[pS]+ [pus] +0. (6.94) 


In the discussion of shock structure the actual contribution of the right 
hand side of (6.93) will be studied in more detail. 

It is interesting that the four equations in (6.90) and (6.91) are the only 
conservation equations that can be found for the set (6.60)-(6.62). To 
prove this result consider 


af ὃ 
oF 


ot Ox ve 


where f,g,h are functions of p,p,u. If this equation is expanded in terms of 
derivatives of p,p,u, and if (6.60), (6.61), and (6.63) are used to replace the 
t derivatives, we have 


(s,—Wi,)ee+( 8, uf - 1 Wee ( Bu~ τρί, τ ραῖ, y+ h=0. 


Since this is to be an identity, the coefficients of the derivatives must 
vanish separately and h must be equal to zero. The three equations for f 
and g can be solved to show that the most general solution for f is a linear 
combination of p, pu, 4pu?+pe, pS. Thus the only independent conserva- 
tion equations are those already noted. Any three of the four can be used 
to generate a “weak solution,” but only the choice (6.90) with jump 
conditions (6.87)—(6.89) corresponds to the real physical situation. 


Useful Forms of the Shock Conditions. 


First it is convenient to write the shock conditions (6.87)-(6.89) in 
terms of the relative velocities v= U—u. With this substitution for u, they 


become 


[po] =0, 
[ p+ pv? —pvU ]=0, 


(a+ 50?) —(p+pe%) U+ + pvU? =0, 


where ἡ is the enthalpy e+p/p. By taking linear combinations, they may 
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be reduced to 


[ pv | ΞΞ 0, [ p + pv] =0, oo( + 302) τὸ 


These are the shock conditions for steady flow in a frame of reference in 
which the shock is at rest. If p,0,=p,v0,~0, the constant factor pov may be 
dropped in the third condition and we have 


P202= P10}; (6.95) 
Py + pyvz =P, + Pye, (6.96) 
Ἐπ 6.97 


The usual situation is that the flow ahead of the shock is known and 
the shock conditions are used either to determine the flow behind in terms 
of the shock velocity or to determine the shock velocity and the remaining 
flow quantities in terms of one of the flow quantities behind. We note 
explicit formulas in the case of a polytropic gas. It will be useful to include 
the expressions for the sound speed even though they follow from those 
for p and p. For a polytropic gas 


ee 


, aS 


Po aay, (6.98) 
p p 
and the required formulas are derived by straightforward manipulation of 
(6.95)-(6.97). | 
When the flow quantities are expressed in terms of U it is convenient 
to use the parameter 
_ ὕ -- μὴ 


Μ ; 
ay 


which is the Mach number of the shock relative to the flow ahead. Then 


Uy—U 2(M*-1 

iy π΄ Ξ|ς (6.99) 
ay (y+1)M 

Pr (y+1)M? 


ro (y—-1)M242” (6.100) 
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P,P; _ 2y(M*—1) 
ee = a (6.101) 


a, (2γΜμΜ᾽-(γ- 1} {(y— M242)" 


ay (y+1)M ee 


When p, is taken as known, it is convenient to introduce the shock strength 


—?p, 


ΟΝ 


Py 


and solve the shock relations in the form 


1/2 
M= ra -(1+ ay | (6.103) 
u4—Uu 
-- = 53 (6.104) 
" ie 
Y 27 Ζ 
-+- 
+5 Ζ 
ea, (6.105) 
οι ΣΝ | 
ay 
1/2 
γ΄- \ 
ee 5 
a, 
= = (6.106) 


Properties of Shocks. 


Certain important properties of shocks will be noted from these 
formulas for polytropic gases, but the qualitative results are true in general. 
First we check the condition (6.94). For a polytropic gas, S=c,logp/p”; 
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hence from (6.105) and the definition of z, we have 


=o Ξε ξεοξεοοιεει τας (6.107) 


This is not zero when z#0; the entropy does indeed jump at the shock. 

According to the second law of thermodynamics, the entropy can only 
increase following a particle. Therefore if a particle passes from side 1 to 
side 2, we require S, >S,. It may be shown from (6.107) that d(S,—S,)/dz 
>0 for y>1, z>-—1, and these always hold. Hence S,—S,>0 implies 
z>0. Thus a shock should always be compressive with p, >p,, and it then 
follows from the other relations in (6.103)}(6.106) that 


P2>P pp ρῃ θ᾽» a,>a,, uU,>U), M>1. (6.108) 


Another approach to this question of the sign of the jumps is to ask 
when shock waves are required by the breaking of waves. In the simple 
waves of Section 6.8, the propagation speed is u+a, so that the shock 
formation condition discussed in Section 2.6 becomes 


u,+a,>U>u,+a,. (6.109) 


That is, the shock is supersonic viewed from ahead and subsonic viewed 
from behind. From (6.103), it is clear that U >u,+a, requires z>0, and 
the other inequality then follows from (6.103), (6.104), and (6.106). 

The jump in the entropy at a shock, (6.107), depends on the strength 
of the shock. As a consequence, the flow behind a shock of changing 
strength cannot be isentropic. This has an important bearing on the simple 
wave argument in Section 6.8. At the same time, we should consider the 
possibility of a jump in the Riemann invariant, 


2a 
y-1 


sS= u, 

which was used in (6.73). It turns out that there is a jump in this quantity 
at the shock. On both counts, the simple wave solution would not be valid, 
strictly speaking, when shock waves occur. However, the jumps are surpris- 
ingly small when the shock strength is small or even moderate. 
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Weak Shocks. 


For weak shocks, all the expressions in (6.103)-(6.107) may be ex- 
panded in powers of z. The first few terms are 


U-u, y+ (y +1) 
] + —z- 24 Ο( 25), 
ay 4y 32γ2 ee 
2 
U, uw oz ytti ᾿ 3(y + 1) ; o(z4) 
ay Y  4y? 323 
ρβι z Y 2 3 
—_— == z“+ Ο(Ζ 
ρ᾽ Y 2? (2) 
a,—a --Ἰ 2_ 1 ~1)(y+1) 
12. red aol os oe we ieee we see Be ρον 
ay 2γ 8γ 16γ᾽ 
So-S : 
eet es ; z>+ O(z%), 
Cy l2y 
2 
52 Sy 2 @-a, u-u, 1 (yt) , 4 
a, υγ-| a, a, 32 ᾿ aoe 


In general the jumps in flow quantities are proportional to z, but the jumps 
in S and s are only O(z*). Even for moderate shocks they remain 
surprisingly small; typical values for y = 1.4 are given below. 


ὃ: 2 ay 
0.5 0.003 0.001 
1.0 0.013 0.005 
2.0 0.052 0.019 
5.0 0.215 0.085 
10.0 0.478 0.209 


For shocks of weak or moderate strength, it is a reasonable approximation 
to neglect changes in the entropy and the Riemann invariant. With these 
approximations, the simple wave solution of Section 6.8 can be retained 
and used even when weak shocks are included. 
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Strong Shocks. 


At the other extreme of very strong shocks, the asymptotic behavior as 
z—oo may be used. The most convenient form is obtained from (6.99)- 
(6.101). Usually when these are required U > u,,so we assume that M~U/a, 
and M>1. Then (6.99)-(6.101) may be approximated by 


2 P2 yl 2 7 
oS ae τς eres | σα a (6.110) 


where we have used a?= yp,/p, to eliminate both p, and a,. The expres- 
sions involve only the parameter p, from the flow ahead; u,,p,,a, are now 
small compared with values behind the shock and so are negligible in this 
approximation. It is interesting that there is only a finite compression 
P,/p,, however strong the shock may be. 


6.11 Weak Shocks in Simple Waves 


As noted previously, it will be a good approximation when the only 
shocks occurring are weak or of moderate strength to retain the relations 
γ-! 
S= Sp), a=ayt —>—u (6.111) 
in a simple wave. Then, as we saw in (6.83), the remaining equation may 
be written 


y+1 
u,+c(u)u, =0, c=atu=ayt——u. (6.112) 
In addition to eliminating two equations, the relations in (6.111) already 
satisfy two of the shock conditions approximately and there is just one left 
to accompany (6.112). Since this treatment is approximate, there are many 
choices that agree to lowest order. As we saw in Chapter 2, the most 
convenient one to use with an equation like (6.112) is 


Cy +c 
ae ae, (6.113) 


which reduces to 


+] | 
U= (ay + uy +a, + uy) = a+ —— (uj + uy) (6.114) 


1 
2 
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in this case. From the weak shock conditions, 


2 
i, ge ee, Ie 


z7+ Ο(25), 
ay 4y 32y? (2) 


2 
a,—a u.—Uu + | + | 


2 a, a; 4y 16? 


so that (6.114) is true to first order in z, but not to second order. This 
assumption is not as accurate as (6.111). One could relate U to u, and u, 
by an expression that is accurate to second order in z, but the extra 
complication in the shock fitting is usually not worth it. 

The problem of shock fitting is essentially the same as in Chapter 2 
and, with the simple form (6.113), follows closely the account in Section 
2.8. 

We could still consider the initial value problem for (6.112) but this is 
only a special case of the full initial value problem in gas dynamics; it 
applies only if the prescribed values satisfy (6.111) so that the flow 
produced is a simple wave. It is more natural here to consider again the 
piston problem with 


u=g(t)=X(t) on x=X(p), 


and complete the discussion of Sections 6.8 and 6.9 in the case that shocks 
are formed.* The solution is 


u=g(7), (6.115) 


xa X(1)+{ aot #(1) | (0-7), (6.116) 


where the characteristic variable + is chosen so that t=7 on the piston 
path. Shocks will be required on characteristics with g(r)>0. We could 
relate the problem to the initial value problem by extending the 
characteristics back to the x axis as in Fig. 6.3 and taking 


f= X(r)- {a+ Ye er) \r, F(£) = ay+ ΤΣ 9(1). 


Then the results of Section 2.8 could be applied. It is perhaps clearer, 
however, to proceed directly. 


*The formulas will be given in a form that covers also the more general case in which the 
prescribed g(?) is not equal to X(z). 
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Fig. 6.3. Shocks produced by a piston motion. 


We treat the case of a front shock propagating into an undisturbed 
region with «=0 ahead (Fig. 6.3). If the shock path is x= s(t), the shock 
condition (6.114) gives 


ds_ ΥΙι 
ah =A)+ 4 »(τ), (6.117) 
and the characteristic equation (6.116) gives 
+1 
ΓΕ {a5 Ye 40) Gay (6.118) 


These two equations are solved to find s and ¢ as functions of the 
parameter τ and thus determine the shock. Since X(t)/ay<1, X(1)/ag<7, 
and ¢>>7 at the shock, it is sufficient in the shock calculation to approxi- 
mate (6.118) by 


+1 . 
s(1)={ ay+ 1 a(1) | age (6.119) 
Then we have 
ds _ y+1 y+1. dr 
dt = dyt τ 46): 5 g(r)t ao} dt’ 
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Comparing this with (6.117), we have 


y+ Y+1.. ὧν ar 
dag on 8 ae a 
This may be integrated to 


+] 


Fag Oe {κ(γατ΄. (6.120) 


Relations (6.119) and (6.120) determine the shock. 
If g(0)>0, the shock starts immediately at the origin. The relation 


between 7 and ¢ is 


so that 
+1 


The shock starts with velocity ay+{(y+1)/4} 9(0); this checks with 
(6.114). 

If the piston comes to rest so that g(r)0 as t->7,, the asymptotic 
behavior is 


+1 7 
τς - εἴ). [ 'g(t)dr 
0 


440 


: 1/2 
s~agt + {(7+ 1)ay [“a(r)ar}  £°/?— ary 
0 


4a ae 
ἐξ ἘΝ eas πο ον ~1/2 
u= g(r) Al [ ΠΣ) fares 


The shock path is roughly parabolic in the (x,f) diagram and the shock 
Strength decays like ¢~'/*. Between the shock and the limiting 
characteristic T= 7), we have 


+] 
χα ων (7) γε agro 
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so that 


Further shocks can be handled similarly. If the piston returns to its 
original position at t= 7 and remains there, the asymptotic behavior is a 
balanced N wave with shocks at 


1/2 


x=ag(t—7)* {C+ Daof “e(rar} 1". 


and 


2 χ-αρί(!-- το) 
ἜΣ  - fice Searle 
y+1 t 
between them. 
Other special cases and limiting results can be derived following the 
arguments in Section 2.8. 


6.12 Initial Value Problem; Wave Interaction 


The simple wave is a disturbance propagating on one family of 
characteristics. In general there will be waves on all three families. The 
complete initial value problem will require all three, unless the values 
prescribed already satisfy the simple wave relations. If the initial values are 
uniform with u=0, a=ad,), S=S, except in the range a<x <b, the (x,?) 
diagram is as shown in Fig. 6.4. There is an interaction region adceb, but 
then, provided shocks do not occur, the disturbance separates into three 
simple waves as shown. Their existence is established by noting that in 
each of them two of the characteristics originate in the initial uniform 
region. 

Between the simple waves, all three characteristics originate in one or 
the other of the uniform regions; hence they are uniform states with u=0, 
a=4d,, S=Sy. Once the interaction is solved, the simple waves can be 
described analytically (similar to Section 6.8) with boundary conditions 
provided by the interaction region. 

If the entropy is initially uniform everywhere, the flow is isentropic. 
The P wave in Fig. 6.4 drops out and the interaction region is limited to 
abc. 
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Fig. 6.4. Initial value problem in gas dynamics. 


In practice, the interaction region is best solved numerically. However, 
some analytic simplification can be made in the isentropic case, and a 
related problem was solved completely by Riemann. For isentropic flow, a 
may be used in place of p and the equations may be written 


a, + ua, + au, Ξε, 
(6.121) 
u, + uu, + yx = 9. 


Equations such as these can be transformed into a linear set of equations 
by interchanging the roles of the dependent and independent variables: the 
so-called “hodograph” transformation. In this transformation, 


_ & _ ας u, 
-τ’ στ: ες ye Τ᾽ 


x 


Qa 
where J is the Jacobian: 
“πα, — a,u,. 


Since the equations (6.121) have one and only one derivative in each term, 
the Jacobian cancels through and the equations become 


χ, τ : 5 at, =0, 
Xu aS ee πῶς 
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These are now linear equations for x(u,a) and t(u,a). Notice it is crucial 
that J cancel through; otherwise the equations would still be highly 
nonlinear. 

A single equation for t(u,a) may be obtained if x is eliminated by 
cross-differentiation. Then we have 


2 2 
y=! Ὑ:- 1 ie, os 
[ 5 ) tat is 


With b=2a/(y—1), n=(y+1)/2(y— 1), the equation becomes 


2n 


th, + 
bb b 


th = μι" 

For n=1 this is the wave equation for spherical symmetry, which has a 
relatively simple general solution. Indeed, when n is any integer, the 
general solution is not too difficult and may be written 


po (8- e) { Aeryeee | (6.122) 


du2 db? b 


where F and G are arbitrary functions. It is fortunate that the two 
interesting cases of y= and y= are included as the cases n=2 and 
n=3, respectively. The Riemann invariants are u+b, so the basic 
characteristic structure is apparent in this expression. 


In the linearized approximation to (6.121), that is, 


Ω, Ἔ Att, = (), u,+ Al =(), 


the general solution is 
u=f(x— aot) Ὁ δ(χ - ἀρ), 


2 
γ-1 


b— bo= (a— αν) =f(x— aot) — σ(χ- apt). 


The nonlinear solution takes a form relating to the inverse of this. 
For the initial value problem, a and u are prescribed functions 
a=@(x), u= U(x) at ¢=0. 


In principle, by elimination of x, this defines a certain curve 4 in the (6,1) 
plane, and we have t=0, x given, on §. These two boundary conditions 
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are sufficient to determine the solution in the appropriate characteristic 
triangle. In practice, however, it is simpler to use a numerical solution in 
the (χ, ἢ) plane, as noted previously. 

Simple waves have either u+ b or u—b constant, so the mapping into 
the (u,b) plane is singular; the entire simple wave region in the (x,t) plane 
is mapped into a line in the (u,b) plane. The interaction of two simple 
waves is then formulated in the (u,b) plane as requiring the solution with f¢ 
prescribed on characteristics 


The determination of F,G is easier in this case. The solution can be found 
also for general y; the details are given in Courant and Friedrichs (1948). 
This analysis is limited to shock-free flows and seems to be mainly of 
academic interest. In view of this, the details are not documented here. 


6.13 Shock Tube Problem 


One special initial value problem, which includes a shock, can be 
solved exactly and very simply. It is also important since it refers to the 
main device for producing shocks in experimental studies. A shock tube is 
a long tube with an end section partitioned off by a thin diaphragm. The 
gas held behind the diaphragm is pumped up to a high pressure. The initial 
state is two uniform regions with 


P=P) P=? in x >0 


= 
I 
= 


and 


u=0, P=Pa>Pi P=P, inx <0. 


The diaphragm separating the two uniform initial states is burst to produce 
a shock propagating down the tube. If the viscous effect of the side walls 
of the tube are ignored so that this may be treated as a plane wave 
problem, and if the solution is limited to times before the waves are 
reflected from the ends of the tube, the exact solution may be obtained 
analytically. 

The (x,t) diagram is shown in Fig. 6.5. The interface separating the 
two gases moves down the tube, there is a compressive shock moving into 
the gas on the low pressure side, and an expansion wave moves into the 
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Fig. 6.5. The (χ, ἢ diagram for a shock tube. 


high pressure gas. Since the parameters provided by the initial conditions 
do not specify a basic length or time, and the lengths of the tube sections 
are irrelevant in the early stages before the disturbances reach the ends, 
dimensional arguments show that the solution must be constant on the 
lines x /a,t=constant in the (x,t) plane. Therefore the velocities of the 
shock and the interface must be constant, and the expansion wave must be 
a fan centered at the origin. There are uniform regions 1, 2, 3, 4 as shown; 
regions | and 4 are still in the original uniform states. The problem may be 
viewed as the combination of two piston problems, using the interface as 
an effective piston. The fluid velocities on the two sides of the interface 
must be the same as the velocity of the interface, so it is like a solid wall as 
far as the flow on either side is concerned. However, its motion must be 
determined as part of the problem and cannot be prescribed in advance. If 
the velocity of the interface is V, the shock conditions (6.103)-(6.105) may 
be used with u,=V, u,=0, to determine p,, p,, U in terms of V. In 
particular, the shock strength z=(p,—p,)/p, is determined from (6.104): 


ee ee (6.123) 


The gases on the two sides of the diaphragm may have different values of 
y. We use y, for the gas ahead and y, for the gas behind. The expansion 
wave between regions 4 and 3 is a simple wave with 


For a polytropic gas, S=c,logp/p™, a7=y,p/p, so that these two rela- 


186 GAS DYNAMICS Chap. 6 


tions can be solved to find also p,p,. In particular, p, is determined from 


(γ,-- 1)72γ4 
a 
| pene ee /1-(22) | (6.124) 


P4 


If required, the details of the flow in the centered simple wave can be 
calculated; the solution is similar to (6.79) but on the other family of 
characteristics. 

At this stage, the solution is completely determined in terms of V. The 
final relation is to enforce the condition p,=p,, since the interface has no 
mass and cannot sustain a net force. These pressures are determined from 
(6.123) and (6.124), and they provide the equation for V. The result of 
most interest, however, is the shock strength z. If we substitute p,=p, 
=p,(1+z) in (6.124) and equate the two expressions for V/a,, we have 


> a (γ4,-- 1)72γ4 

Z 

— -Ξ ες —4 (+z »| 

| yl \\” ee ae 
1+ Zz 

Ἢ 27; 


This is the equation for z in terms of the known quantities p,/p,, a,/ aj. 


6.14 Shock Reflection 


The reflection of the shock from the end wall can also be solved 
exactly. The normal reflection of a plane shock from a plane wall can be 
analyzed from the shock conditions. Let subscripts 1 and 2 refer to the 
states ahead of and behind the incident shock, and subscript 3 refer to the 
state behind the reflected shock. If the incident shock strength is denoted 
by z;=(p.—P;)/P}, the state 2 is determined by (6.104)-(6.106) with z = z,. 
The reflected shock has state 2 ahead and state 3 behind, so that if the 
reflected shock strength is z,=(p3;—p )/p2, then from (6.104), with suitable 
changes in subscripts and change in sign of the velocities because the 
reflected shock travels in the opposite direction, we have 


uy τ U4 Ζ, 


a (14 y+1 \ 
Z 
1+ 5-2, 


Next to the wall the gas must be at rest; hence u,=0. But we also know u, 
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and a, in terms of z,, so this is a relation for z, in terms of z;: 


Ζ; Ζ, 


oP ae 1/25 
Gang eee 
Y i 2γ 2; 2γ 4 


This leads to a quadratic for z, and the relevant solution is 


Ce ξος τ Ξε ὡς (6.125) 


For weak shocks, z,>0 and (6.125) is approximately z,~z,; hence 
P3—P\~2( P2.- Pi); 


and the pressure increase at the wall is approximately twice that in the 
incident shock. For strong shocks, z,->00, we have 


2γ P3 37-1 


—, = ὃ for y = 1.4. 
y=) Py YO! : 


6.15 Shock Structure 


In accordance with the general point of view that evolved in Chapter 
2, a shock is interpreted as a thin region in which rapid changes of the flow 
quantities occur. It is a discontinuity in one level of description, but it is 
replaced by a thin region in a more accurate level of description. As a 
check on this, particularly regarding the correct choice of conserved 
quantities, and in order to estimate the shock thickness if necessary, we 
consider the shock structure in the special case of a transition between two 
uniform states. 

For one dimensional flow, the equations for conservation of mass, 
momentum, and energy are 


p, + (pu), =9, 


(ou),+(ou?—py1),=0, (6.126) 


( 50? pe) + [ (50+ e)ou—pi.u+ ar} =0. 


x 
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An improved description over the one used so far is to take the Navier- 
Stokes relations for the stress p,, and the heat flux 4), while retaining the 
assumption of local thermodynamic equilibrium. The assumptions are that 
P\, depends linearly on the velocity gradient and q, depends linearly on the 
temperature gradient. They are noted in general in equations (6.28) and 
(6.29); for one dimensional flow they reduce to 


x 


4 
Pu=—Pt τ μι, 415: —AT,. (6.127) 
The thermodynamic relations 


1 P 
e= ----- -- = ἢ oT 6.128 
y—lp mae ( ) 

complete the system for ἃ polytropic gas. 
For the shock structure solution the flow is steady relative to the 
shock. Hence all flow quantities are functions of X=x-— Ut alone. For 


such functions 


0 d 0 d 


ὃ dX”~—sx.s aX 
and the equations (6.126), being in conservation form, may be integrated 
to 


— Up+ pu=A, 


- upp Syay)= 2, (6129 


= (5:90? + pe) + (5+ e)pu + pu— $ wy AT, | =C, 


where A, B, and C are constants of integration. As X-> + 0, the flow tends 
to a uniform state denoted by subscript 1. The constants A, B, and C are 
determined then in terms of U,u,,:,,p,. If the flow also tends to a uniform 
State μ2.ρ2.} as X—>— οὐ, it is clear that the states at + oo are related by 
the shock conditions (6.87)-(6.89). 

The relations (6.127) may also be used to explore further the equations 
for the entropy change. We may now write (6.92) explicity as 


4/3) puy + (ATy) Χχ 
4 (gus) = “mkt Oe 
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or, better still, 


4 AT) (ἀ)βγμιᾷ +073 
fe Pw u)S+ Τ' \- T : 
Hence 
© (4/3) pur +ATZ 
[o(u-wshi= [ hele Sat oN 


— co 


The entropy change across the shock is clearly seen to be a con- 
sequence of the dissipation of energy by viscosity and heat conduction, 
and the sign of the entropy change is automatically predicted. 

The details of the shock profile between the limiting values at + 00 are 
provided by the ordinary differential equations in (6.129). With v= U—u 
and new constants related to A, B, and C, they may be put in the form 


po= Q, 


pv*+p+ poy = P, 


(n+ 50 )pe+ Ἐμοῦ +ATy = Ε.. 
These are the steady flow equations in a frame of reference moving with 
the shock, but with v measured positive in the negative X direction. The 
Shock conditions relating the uniform states at +oo are now in the 
corresponding form (6.95)-(6.97). 

The continuity equation pv=Q and the relations in (6.128) may be 
used to reduce the system to two equations for Ὁ and T. For a polytropic 
gas, 


Ύ 
h= se νι ἐπὶ 1 


and it is a little more convenient to work with v and ἢ. The equations are 


—] 
$uoy=P—o[o+ A) (6.130) 


λ 4 i 7 
ee Ἔ μου = E~ O[h+ “Ὁ ) (6.131) 


P 
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It is a simple matter now to show qualitatively that a solution of the 
required form exists. In the special case A/c,= 4p, which is a good 
approximation for air, an integral can be found and the solution exhibited 
explicitly. (The quantity pc, /A is the Prandtl number, and it is 0.71 for air 
at normal temperatures.) For this value of A/c,, (6.131) can be written 


4 ba) aig 1 Ἵ 
τ μ(λ: 50°) Ε O[h+ 50") 


The right hand side vanishes as X00, that is, h, +4v0?=E/Q. Therefore 
the only solution that is bounded as Χ-» -- οὐ is 


h+ i, = = 
2 Q 
throughout. In this case, h+4v? is not only the same on the two sides of 
the shock; it remains constant throughout the shock. Equation 6.130 may 
then be written 


4 ¥Ch VLE 
= uvy = P— QO| ——v+ ——-=- |. 
a οἱ dy Ty a 


Since the constants must be such that the right hand side vanishes for both 
v=v, and v=v,, it must be possible to write it 


4 y+1_ (v,-—v)(v- 02) 
Ὥς aaa ea 


3 Bex 2γ 


This is easily integrated to give 


0 Ἐς SC a 
ἐμ. 2y X οι τ og(v— v2) D1 — Ὁ: og(v,—v). 


We have Q=9,0,, so the shock thickness 15 proportional to 


pad eat Ge 
3p, y+ 1 v,— 0) 


As expected it becomes thinner as » decreases for fixed strength and also 
as the strength increases for fixed p. 
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6.16 Similarity Solutions 


The simple wave solution is limited to plane waves moving into a 
uniform region. The problems of cylindrical or spherical symmetry and of 
plane waves moving into a nonuniform region are more complicated. A 
fairly general approximate theory for weak waves can be obtained (and 
this is developed in Chapter 9), but there are also some special exact 
solutions, which fit more conveniently into the account in this chapter. 

We consider first cylindrical or spherical wave motion. The equations 
(6.49) reduce to 


ρ, + uo, +o u, += |=0, (6.132) 
u,+ uu, + τρ,-ῦ, (6.133) 
p, + up, — a*(o, + up,) =0, (6.134) 


where r is the distance from the center and j=1,2 for cylindrical and 
spherical waves, respectively. 

The characteristic equations are nearly the same as in Section 6.7; the 
extra term jpu/r does not involve derivatives and therefore does not 
change the appropriate choice of the linear combinations. The 
characteristic equations become 


Cae gee | A 
dp » dp dr 
ea ae oe 


The innocuous-looking extra term in (6.135) invalidates the simple wave 
argument. In isentropic flow, the C_ characteristic equation becomes 


d{ 2 QU GP Ὁ 
(a u)+i2 = (), 7 μ--α. (6.137) 


This can no longer be integrated once and for all to give the simple 
relation between a and u. As a consequence there are no exact solutions 
corresponding to the simple waves of plane flow. One can proceed by 
means of certain approximate methods to get analogous solutions, but they 
are limited to weak disturbances. These approximate theories are taken up 
in Chapter 9. 
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However, turning to another approach, we can find a different class of 
exact solutions which are surprisingly useful. The set of equations (6.132)— 
(6.134) has special similarity solutions in which all the flow quantities take 
the form ¢”f(r/t"). These have the simplifying feature that the partial 
differential equations reduce to ordinary differential equations with inde- 
pendent variable r/t”. 


Point Blast Explosion. 


One of the best known similarity solutions describes the blast wave 
produced by an intense explosion. It was found independently by Taylor, 
Sedov, and von Neumann in connection with atomic bomb research. Its 
form can be argued on dimensional grounds. It is supposed first that the 
explosion can be idealized as the sudden release of an amount of energy E 
concentrated at a point, and that this is the only dimensional parameter 
introduced by the explosion. Second it is supposed that the resulting 
disturbance will be so strong that the initial pressure and sound speed of 
the ambient air are negligible compared with the pressures and velocities 
produced in the disturbed flow. Then the only dimensional parameter 
relating to the ambient gas is the density pp. In particular, the strong shock 
relations (6.110) apply; that is, 


-- Ὁ) _yti ey! 2 
u= ——— U, p= 7 Po P=T Fy Pol” (6.138) 


behind the shock moving with velocity U. 

The dimensional argument is based on the fact that the only parame- 
ters in the problem are E, with dimensions ML?/T?, and po, with di- 
mensions M/L?. The only parameter involving dimensions of length and 
time is E/py with dimensions L°/T* or some function of it. Let us now 
consider various quantities that arise in the solution. The flow is headed by 
a shock at r= R(¢). Since R(t) is a length, the only possible form for its 
dependence on ¢ is 


1/5 
R(t)= 42} 12/5, (6.139) 
0 


where k is ἃ dimensionless number. It then follows from the shock 
conditions (6.138) that the pressure and velocity immediately behind the 
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shock are 
2 1 
8. po (Ε΄. ἀν ΤΕΥ Σ 
pa ΕῚ ,- 675 σε σα πο 98/5 
25 y+1\ po 5 y+1\ po 
or, equivalently, 
1/2 
8 k τὰ ἅ Χ721Ἑ 372 
= — =— =) κα 6.14 
Po oe al Νὰ τὶ et) 


It is striking, as usual, that such valuable information can be obtained at 
the outset from a simple dimensional argument. 

The argument can be taken further to show the functional form of 
u,p,p in the entire flow field. Since there are no independent length and 
time scales provided by the parameters in the problem, only the combina- 
tion E/p, with dimensions L°/T’, any dimensionless function of r and ἢ 
can depend only on the combination {= Et’ / ρο or some function of it. 
We shall in fact use 

r 
which is proportional to ¢ —'/5 from (6.139). Then, for example, ut /R, 
p/P pt?/poR* are all dimensionless and must be functions of ξ alone. 
Following Taylor (1950) we take 


2 
μ- 2 Fo, ρβρψῷ, »-Ἐξ 5) "Ms (614) 


the factor of 2 is included because 2R/St is the shock velocity. There are 
other equivalent forms and the choice 


2 
2 2, 
=22y(@), p=m%(@), p= ($1) por) 
5 t 5 t 
fits a general scheme discussed below. Clearly the connection is 


p=tV, w=Q, f=yé’P. 
The shock is at €=1 and the shock velocity U is R=2R/5t, so the 
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shock conditions are 

[Ξ Ὁ je. goa (6.142) 

φί ΞΞ τ τ’ ψ yl? at . ° 

When the expressions (6.141) are substituted in the equations of motion, 
three first order ordinary differential equations are found for the functions 
φ(ξ), ψ(ξ), f(€). These have to be integrated from €=1 to €=0, with initial 
conditions (6.142). The parameter k does not appear in the equations. It 15 
fixed from the definition of E as the total energy in the flow. That is, we 


require 
R(t) 
= a ,ἘΕ 1 2 2 
Ε J (25 + So Joan dr, 


_ 5 2 : | l 2 2 
1 -- 4πκ (=) i Fesn +509 " ἀξ. 


The functions 9,y,f in Fig. 6.6 are taken from Taylor’s numerical integra- 
tion of the equations in the case y=1.4. It was shown by Sedov, using 
different variables, that the equations could in fact be solved analytically; 
reference should be made to Sedov’s book on similarity solutions. (1959, 
Chapter 4). 


which leads to 


f and φ 


0.2 Φ 2.0 
ο 
0.2 04 O06 O8 1.0 


Fig. 6.6. The normalized velocity φ, density y, and pressure f in the point blast explosion 
(Taylor). 
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Similarity Equations. 
The point blast similarity solution is one of a family in which 


uan"V(O), ρ- προς PO) 
(6.143) 


p= pp2(§), f= (ct)"" 


If these expressions are substituted in (6.132)-(6.134), and the forms 


0 Vinny all op) 
2 O=Es'O= - 27, 


2 =8F0= FO, 


are substituted for derivatives of functions of ξ, the factors in r and ὦ 
cancel through and leave ordinary differential equations in € alone. They 
are most conveniently written in terms of V, A=(yP/Q)'/? and Q; the 
sound speed is 


a=n-A(é). 
The equations are 

(Ψ- η᾽- aye οἰ εὴν- ᾿-1,.5. γιν-ην- 
PAE =F IV (v-1(v- 1} 
(6.144) 

{(V- ya} 5 =f 2-1 '}ar4 τ νίν- =) 
- ρσευνν-ουητ-ην- 4) (6.145) 

((y- 47} 5B =a i+ NU- = κν-η a 


= γν- *)-U+ ἡγίν-- ἡ. (6.146) 
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There is a possible singularity if 


(V-1)’-A?=0, (6.147) 


and this would be a singularity on a curve €=constant. The original 
equations are hyperbolic so we know that singularities can occur only on 
the characteristics 


ἼΘΙ’ ΠῚ (6.148) 


On ἃ curve €=constant, we have 


dr _ nr 


dt τ᾿ 


Therefore a curve €=constant which is also a characteristic must have 
V+A=1. This agrees with (6.147). Thus, a singularity can occur only on 
the characteristic through the origin, and this limiting characteristic is one 
of the family £=constant. | | 

In the point blast problem, n=2/5, the. limiting characteristic does 
not appear in the flow behind the shock. Its position would be in the 
region ahead of the shock, as shown in Fig. 6.7, but the flow is taken to be 
uniform there, with V= A =0, and there is no singularity. It represents the 
edge of the fold in the (7,t) plane in the multivalued solution which is 
replaced by the shock. 


Guderley’s Implosion Problem. 


The limiting characteristic plays a crucial role in the problem of an 
incoming spherical or cylindrical shock collapsing to the center. In this 
case there is no dimensional argument to establish that the solution must 


Fig. 6.7. The (x,7) diagram for the point blast explosion. 
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Fig. 6.8. The (χ, ἢ diagram for Guderley’s implosion solution. 


be a similarity solution. However, Guderley (1942) proposed that it be of 
the above form (6.143) for some exponent n to be determined. The origin 
of t is taken to be the instant at which the shock reaches the center, so that 
t<0 and C<0 in (6.143). One might argue for a similarity solution on the 
grounds that there must be a singularity at the center with a shock coming 
in on some curve in the (r,7) plane and being reflected back along another 
curve. This perhaps suggests that the solution is related to a family of 
curves coming into the origin, and the family r/(—1)"=constant is the 
simplest to try. In any event, it works! The equations are then as derived 
above (6.144)-(6.146) for some n to be determined. The incoming shock 
may be normalized to be at 


r —- 


(Ct)” 


ἐξ 


3 


since C is an adjustable parameter. 

In this geometry, it is clear from an outline of the (r,t) plane (see Fig. 
6.8) that the limiting characteristic through the origin is in the flow region. 
And the question of a singularity on it becomes most important. In 
integrating (6.144)-(6.146) for V(€), P(é), Q() from their values at €= 1, the 
curve (V~—1)*— A*=0 will be reached. There would then be a singularity 
in the solution except in the special case when the right sides of (6.144)- 
(6.146) also vanish. Obviously a nonsingular solution is required, and the 
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TABLE 6.1 
Cylindrical j= 1 Spherical j =2 
Y n (l—-n)/n n (l—n)/n 
5/3 0.815625 0.226054 0.688377 0.452692 
7/5 0.835217 0.197294 0.717173 0.394364 
6/5 0.861163 0.161220 0.757142 0.320756 


exponent ἡ has not yet been specified. Guderley put the two together and 
proposed that the value of n should be the one that makes the right hand 
sides of (6.144)-(6.146) vanish at the curve (6.147) and allows the solution 
to be continued smoothly across the limiting characteristic. 

The numerical integration has been repeated with great accuracy by 
Butler (1954) and the values found for n are presented in Table 6.1. The 
pressure at the shock and the shock velocity are given by 


por τ m/n Uap τη). 


the exponent (1—17)/n is also presented. It is intriguing that the exponent 
(1—n)/n for j=2 is nearly double the value for j = 1. One is tempted to try 
arguments to prove that this should be exactly true. An approximate 
theory, described later (Chapter 8), gives this result automatically, but the 
proposed result does not appear to be exactly true. 

Guderley also showed that the reflected shock can be fitted into the 
same similarity solution. Of prime interest is the increase of strength on 
reflection. At the center both the incoming and outgoing shocks have 
infinite strength and velocity in this idealized model, but the ratio of the 
strengths remains finite. Guderley shows that for y=7/5 the ratio of the 
pressure behind the reflected shock to the pressure behind the incident 
shock is about 26 for spherical waves (j=2) and about 17 for cylindrical 
waves (j= 1). These compare with the ratio 8 for plane waves (see Section 
6.14). 

The infinity at the center would be modified by viscous effects in any 
case, but a more important question concerns stability. It will be shown 
later that an approximate theory predicts that small, unsymmetrical deriva- 
tions in the shape of the shock would grow and the shock would not focus 
perfectly at the center. However, it seems that this instability affects the 
behavior in only a small neighborhood of the center, and over much of the 
motion Guderley’s solution applies. 
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Other Similarity Solutions. 


On dimensional grounds one can argue that the flow produced by a 
uniformly expanding sphere must be a similarity solution with u,p,p 
functions only of r/Ct, where C is the velocity of the sphere. The 
equations are (6.143)-(6.146) with n=1, and in this case the similarity 
solution is not limited to strong shocks. This was proposed and solved by 
Taylor (1946). 

In some ways, problems of plane shocks propagating through a 
nonuniform density distribution p(x) are analogous to spherical or cy- 
lindrical waves. It turns out that there are some corresponding similarity 
solutions. Sakurai (1960) investigated cases in which p,(x)x x” and found 
that they could be analyzed like the implosion problem. In particular, the 
exponent in the similarity variable is found by suppressing a possible 
singularity on the limiting characteristic through x =0. (See Section 8.2.) 

Other cases, including an exponential density stratification, have been 
studied by Sedov, (1959, Chapter 4), Zeldovich and Raizer (1966), and 
Hayes (1968). 


6.17 Steady Supersonic Flow 


Problems of steady supersonic flow can also be treated by the 
methods developed for unsteady waves. There are, in fact, close analogies 
between problems in the two fields. Two dimensional steady flow corres- 
ponds to unsteady plane waves, axisymmetric steady flow to cylindrical 
waves. 

If body forces are neglected, the equations for steady flow may be 
written 


cs 
v(54 )+oxaq= - 1p, (6.150) 
q:VS=0. (6.151) 


These are taken from (6.49) with minor changes in form. It is convenient to 
use vector notation, to use q for the velocity vector (since later in two 
dimensional flow the components will be denoted by u and v) and to 
replace the original expression (q- V)q on the left of (6.150) by the equiva- 
lent expression shown, where w=curl q is the vorticity. 
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Now the thermodynamic relation (6.31) can be taken to be 


PiSSais te 
p 
and used with (6.150) and (6.151) to deduce the equations 


V(5a+h)+oxq=TVS, (6.152) 


a-V(5@+h)=0 (6.153) 


Therefore, from (6.151) and (6.153), the entropy S and the “total enthalpy” 
h+4q? remain constant along the streamlines in any continuous part of 
the flow. If a continuous flow comes from a uniform state S= Sp, ἢ ΞΞ Πρ. 
q= U, at infinity we have 

1 1 


h+ 54 =hyot ZU", (6.154) 


S=S, (6.155) 


throughout the flow. These relations have to be reexamined if shocks 
occur, since the quantities may jump discontinuously as the streamlines 
cross a shock and the jumps may be different on different streamlines. But 
we accept them for the present. Then (6.152) reduces to 


w xX q=0. (6.156) 


We are interested only in two dimensional or axisymmetrical flows; for 
these, ὦ and 4 are orthogonal so the conclusion is w=0, that is, the flow is 
irrotational. In general, there are special flows—the so-called Beltrami 
flows—which satisfy (6.156) with w40. 

For a polytropic gas, h=a*/(y—1) and Bernoulli’s equation (6.154) 
may be used in the form 


εἴ 
a?= a3— “—— (q?—U?) (6.157) 


to express a in terms of q. If the flow is also isentropic p and p can be 
determined from a and therefore from q. 

We now consider a continuous two dimensional flow with uniform 
conditions upstream. The flow is taken to be in the (x,y) plane with 
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q=(u,v). Since all the thermodynamic quantities are known in terms of gq, 
via (6.155) and (6.157), we need two equations for u and v. We may choose 
(6.149) and the irrotational condition #=0. The remaining equations are 
then automatically satisfied by the various integrals used to arrive at this 
point. We have 


(pu), + (pv), =0, 
v,— u, =0, 


where p is to be expressed in terms of u and v by (6.157). The relation 
between a and p is a7~p*~!: hence 


τ΄ Oe Ὸ΄ «ὦ 


ἀρ 1 d(a*) _ udut+vdv 
ρ γι] sa? a 


The equations may therefore be transformed into 


(u?—a’*)u, + 2uvu, + (v?—a*)v, ΞΟ, (6.158) 
υ, — u, =0, (6.159) 


where a? is given by (6.157). 


Characteristic Equations. 


The next step is to investigate the characteristic forms, following the 
procedures of Chapter 5. The manipulation, although elementary, is a little 
complicated and a variety of alternative tricks may be used to keep it as 
short as possible. We choose to start in a straightforward way and consider 
the linear combination 


2. 2 2. 032... ὦ 
(u°—a*)u, + (2uv—1)u, + lv, + (υ΄ -- a*)v, =0. 
This is in characteristic form 


(u?—a?)(u,+ mu, ) + 1(v, + mv, ) =0, 
provided that 


(u*—a*)m=2uv-1 and Im=v?—a?. 
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The requirement on m 15 
(εὖ — a*)m? —2uvm + (οἷ -- a”) =0. (6.160) 


It has two real roots if u?+v?>a*. Therefore the system is hyperbolic in 
regions where the flow is supersonic. The corresponding characteristic 
equations may be written as 


: ad 
(u?— a?)m& + (0? a?) 95 =0 on <= m. (6.161) 


Since only two variables are involved, the differential form 
(u? — a*)mdu+t (v*—a*)dv (6.162) 


can be integrated for each choice of m, and two Riemann variables can be 
obtained. The procedure is clear, but it is at this point that a little 
ingenuity (combined with knowing the answer!) can be used. The guide 
here is symmetry. 

Since m is the slope of the characteristic, (6.160) can be expressed as a 
relation between the differentials dx, dy, on the characteristic and taken as 


(u? — a*) dy? — 2uv dx dy + (v* — a*) dx” =0, 
or, better still, 
(udy —vdx) =a?( dx? + dy’). 


If x is the inclination of the characteristic to the x axis, and @ is the 
inclination of the streamline, we have 


dx =cos x ds, dy =sinx ds, 
u=qcos8, v=qsin8. 
Then the differential relation reduces to 
q’ sin? (x— 0) =a’. (6.163) 


But a is a function of g, so if we introduce a variable p defined by 
sinu= 7 O<p<%, (6.164) 


the characteristic condition (6.162) becomes simply 


X=O+p. (6.165) 
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The characteristics make angles + μ with the stream direction. The quant- 
ity μι is called the Mach angle and is related to q by (6.164) and (6.157). In 
view of their significant roles, we now work towards the two dependent 
variables μ, 0, in place of either q, θ or u, v. 

The remaining problem is to transpose the differential relation (6.162) 
for the Riemann variables. On a characteristic, (6.162) vanishes; therefore 
(6.160) can also be used to relate du and dv. The relation is 

(odv+ udu) = a?(du? + dv’). 
In terms of g and θ, it becomes 
2 dq’ = a?( dq? + 42403), 
that is, 
- 1.2 
d 
do + [Ξ -1} “ =0. 
a q 


Therefore the Riemann variables are 


θΞ- P(p), 


: 1/2 

q dq 

P(w= | (—-1] — 

(1) [{ ᾿ 2 
7 Cos” p j 
sin2u+(y—1)/2 


Ἴ | +] +] 
= I an a ang} (6.166) 


Finally, the characteristic equations are 


where 


6+ P( ») =constant on C4: =tan(o+ μ). 


(6.167) 


θ-- P( μ) =constant on cP =tan(9- 1). 


204 GAS DYNAMICS Chap. 6 


co 


Fig. 6.9. Expanding supersonic flow around a continuous corner. 


Simple Waves. 


The special solution in which one of the Riemann variables is constant 
throughout the flow will again be called a simple wave. It arises in the 
study of flow around an expansive corner as indicated in Fig. 6.9. 
Upstream of the corner the flow is uniform, with w= py, @=0, say. The C_ 
characteristics all start in this uniform region and therefore 


P( 1) — 0 = Ρ(μο) (6.168) 


on each of them. As a consequence, this Riemann variable is the same 
constant throughout the flow. Then, from the equations for the C,, » and 
θ must be individually constant on each C,, and each C, 15 a straight line 
with slope tan(@+ μ). Since = 8, is given at the wall in terms of the wall 
shape y = Y,(x), the solution may be written 


θ--θ,(ξ), Ρ(μ,) = PC Mo) + 4,» 


y=Y,(€)+(x—&)tan(@, + w,). 


(6.169) 


Fig. 6.10. The centered Prandtl-Meyer fan for supersonic flow around a sharp corner. 


Sec 6.17 STEADY SUPERSONIC FLOW 205 


There is a close analogy with the piston problem, both with the solution 
(6.76)-(6.77) and its derivation; the correspondence 15 


Vox, χερί, for. 


All quantities of interest may be calculated from the expressions for μ 
and @. Of particular interest is the pressure at the wall and this requires 
only the Riemann invariant (6.168); the value of μι, is determined in terms 
of @, and the pressure is related with p» by the relation 


7 -- ἢ 
P (ay 1+ (y= 1)/2sin? Hp | iil 
Nag | L4(y—1)/2sin2 pn 


In the limiting case of a sharp corner, shown in Fig. 6.10, the simple wave 
becomes a fan (the Prandtl-Meyer fan) and the solution in the fan is given 
by 


(6.171) 
tan(0+p)= ΔΝ 
X 

When the corner is compressive, multivalued regions develop due to 
the convergence of characteristics (as shown in Fig. 6.11) and a discon- 
tinuous shock must be fitted in. The extreme case of a sharp compressive 
corner is shown in Fig. 6.12. In general, once a shock appears, the various 
integrals along the streamlines and along the C_ are no longer strictly 
valid, since the quantities concerned jump discontinuously at the shock. 
However, this situation is closely analogous to the situation for the corres- 


Fig. 6.11. Shock formation in supersonic flow. 
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P2,P2,92 
Py» Py 


Fig. 6.12. Shock formation in supersonic flow past a wedge. 


ponding unsteady problems treated earlier. For weak shocks the various 
integrated relations are true to first order and the simple wave solution is 
an approximation valid to first order in the strength of the disturbance. It 
is completed by fitting in shocks appropriately. 


Oblique Shock Relations. 


The shock conditions will be required for the oblique shock shown in 
Fig. 6.12. These are easily obtained from the normal shock relations in 
(6.95)-(6.97). If the flow in Fig. 6.12 is viewed by an observer moving with 
speed g,cosB along the shock, the flow on side 1 will appear to be normal 
to the shock. Then (6.95)-(6.97) give the normal jump relations with 
0, = q,sinB, v,=q,sin( 8 —@). Moreover, in this moving frame, the flow is 
one dimensional; therefore the flow on side 2 must also be normal to the 
shock. Hence q, cos 8 = q, cos( B — 8). The complete set of shock conditions 
may be written (in a slightly more general form to allow for an inclination 
θ᾽ of the upstream flow to some arbitrary reference direction) as 


ργᾳχϑὶηί B— θ2) =p,9,sin( B—9,), 
P2+ 0293 sin’( B-9,)=p, + p,q] sin”( β- θ.), 


q,cos( B—9,)=q,cos( β -- θ}). (6.172) 


1 5. ee 
h,+ 5 928i’ ( B—0,)=h, + 5. 41 Sin’ ( β-- θι). 


The shock conditions can also be derived directly from the steady flow 
equations in conservation form. Then they are related to conservation of 
mass, normal momentum, tangential momentum, and energy, respectively. 
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It might be noted that ἢ Ὁ 1 47 is in fact continuous, so that the argument 
leading to (6.154) is still valid. However, the entropy and the Riemann 
invariant do jump. 

The shock conditions determine p,, p,, 42» β in terms pj, p,, 4.» 9, and 
they can be used to solve the problem shown in Fig. 6.12 exactly. If the 
wall curves around after the sharp corner, as for a supersonic airfoil, we 
may use the simple wave solution as an approximation when all the values 
of θ are small. Three of the shock conditions are satisfied to first order by 
the simple wave relations; the remaining one is used to fit in the shock. It 
may be shown from (6.172) that 


~p,+0 
B= n+ 2 — + 0(67). 


Therefore, to first order in 6 and p,—p,, the shock bisects the angle 
between the characteristics on the two sides. This corresponds to (6.113) 
and the shock fitting follows very closely the similar steps in the unsteady 
case. The procedure will be seen later in the discussion of the more 
interesting case of the shock produced at the nose of an axisymmetric 
body. 


Oblique Shock Reflection. 


Finally, we note for later use that if an oblique shock is reflected from 
a plane wall, a possible local flow pattern is as shown in Fig. 6.13. If the 
initial uniform state and B are known, the shock conditions between 
regions 1 and 2 can be used to determine all the flow quantities in 2. Then 
in region 3, 8 is known, since it must take its original value parallel to the 
wall, so the shock conditions applied to jumps from 2 to 3 are again 
sufficient to determiine the remaining flow quantities in 3, together with the 
reflected shock angle β,. 

An important result of this analysis is that the proposed solution 
covers only a certain range of cases. If the shock is weak enough or at a 


Fig. 6.13. Regular shock reflection. 
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Fig. 6.14. Mach reflection. 


sufficiently glancing angle, there is no solution and what happens is that 
the range of reflected shocks available can not turn the flow in region 3 
back parallel to the wall. The whole pattern gets pushed away from the 
wall and assumes the pattern with three shocks shown in Fig. 6.14. It is 
known as “Mach reflection” in honor of Ernst Mach, who first observed it 
experimentally. The analysis of it is still incomplete with some theoretical 
results in apparent disagreement with observations. 


CHAPTER 7 


The Wave Equation 
The equation 
2 2 
at = ¢’V’q, c =constant, (7.1) 


has become known as the wave equation even though the majority of 
waves are not governed by it! However, it does occur in many problems 
and it is the simplest equation for starting the discussion of two and three 
dimensional waves. There is an enormous number of possible topics and 
we must make some choice. Following the general theme of this book, we 
restrict the discussion to basic results which contribute to an understand- 
ing of waves and play a role in extensions to nonlinear theory. We make 
no attempt to give even an introduction to the vast amount of special and 
intricate analysis developed for the various boundary value problems of 
diffraction theory. The elementary aspects of interference and diffraction 
patterns are well documented in a variety of books and the more advanced 
theory rapidly becomes a matter of skillful use of “mathematical methods” 
rather than one of understanding the nature of waves more deeply. 

On the other hand, the approximate theory of geometrical optics 
involves valuable general ideas which can be extended to other contexts 
both for linear and nonlinear problems. It is developed here for the wave 
equation and the extensions for nonuniform media and for anisotropic 
waves are noted. These extensions go beyond the strict discussion of (7.1) 
but the material fits in conveniently here. Other aspects of geometrical 
optics and the development of similar ideas in nonlinear problems are 
considered in later chapters. 


7.1 Occurrence of the Wave Equation 


The wave equation (7.1) occurs primarily in three fields: acoustics, 
elasticity, and electromagnetism. 
209 
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Acoustics. 


The equations for acoustics have been given in Section 6.6. The 
expressions are noted here for easy reference. The gas dynamic equations 
are linearized for small perturbations about a constant state in which 


u=0, ρξρ, P=Po=P(pp). 
The propagation speed is 
45 =P'( po); (7.2) 


and in terms of a velocity potential » the perturbations are given by 


u=Vq, (7.3) 

P—Po= — PoP (7.4) 
Po 

P~ Po= — 5; (7.5) 
ao 


Substitution in the linearized continuity equation leads to the equation for 
φ: 


Pn = αὐ. (7.6) 


Linearized Supersonic Flow. 


The acoustic formulation may be used when the disturbance is caused 
by a moving solid body. If the disturbance is to remain small, the motion 
of the body must be very small (this applies to the cone of a loud speaker, 
for example) or the body must be very slender. The former is a typical 
source for sound waves and the equation must be solved subject to 
appropriate boundary conditions. The case of a slender body moving with 
arbitrary constant velocity relates acoustics to aerodynamics. If the body 
moves with constant velocity, there is an obvious advantage in transform- 
ing to a moving frame of reference fixed in the body. Let (X),X>,X3) refer 
to the original frame in which the motion of the gas is small and described 
by (7.3)- (7.6). If the body moves with speed U in the negative x, direction, 
and (x,y,z) refer to coordinates fixed with respect to the body, the 
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transformation of coordinates is given by 
x=x,+Ut, y=Xy, Z=X3. 


The velocity components in the new frame are (U+u,,u,,u,) where 
u,= dg /dx;. Moreover, the flow appears steady in the new frame so that 


P(X}, Xz, X3t) =O(x,y,Z) 


= @( x, + Ut, x>, x3). 
Therefore (7.6) becomes 
(M2-1)®,,=0, +6 yee (7.7) 
xx yy zz? Ay 3 . 
(7.4) becomes 


P—Po= ~ Po U®,; (7.8) 
and the velocity components relative to the body are 
(U+9,,®,,®,). (7.9) 


For supersonic flow, M>1, we recover the wave equation in a 
reduced number of variables with x playing the role of time. This is a 
linearized version of the analogy noted in Section 6.16. 


Elasticity. 


The derivation of the wave equation in the elementary treatments of 
transverse vibrations in strings and membranes, and of longitudinal and 
torsional waves in bars, is taken to be well known. Here we note only how 
the wave equation arises in the full three dimensional theory. 

The motion of an elastic solid can be described in terms of the 
displacement &x,f) of a point from its position x in the unstrained con- 
figuration. It will also be convenient to introduce X(x,?)=x+&x, ἢ) for the 
new position at time 1. The forces acting across a surface in the deformed 
body can be described in terms of a stress tensor p,,, just as in the case of a 
fluid (Section 6.1). If we temporarily consider the stress as a function of the 
current variable X in the deformed configuration, the stresses produce a 
net force dp,,/dX, per unit volume. This follows from the divergence 
theorem, exactly as in the case of a fluid. However, the preceding 
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“Lagrangian” description of the displacements (which is usually more 
convenient in elasticity) relates all quantities back to the original un- 
strained configuration. Accordingly, the net force per unit volume of the 
unstrained configuration is 


i (7.10) 


where J is the Jacobian 


_ 9(X1,X2,X3) 


-τ----΄“- (7.11) 
θ(Χ»Χ2,Χ3) 


Moreover, 0x,/0X, is J,/J, where J, is the cofactor of the element 
dX,/dx, in the determinant (7.11). Therefore the net force per unit un- 
Strained volume, (7.10), is 


RSL re (7.12) 


The extension of any line element from the unstrained to the strained 
position is obtained from 


I 0 i 
dX? — T= a ax, dx, dx;,,— dx; 
= 2€, dx, ἄχ: 
where 
1 [ 0X; 9X; 5 
“κ " ] Ὅχ, Ox, δ 
1/ 9& 0g dg οξ; 
Ἔν a he ae ee, αοδι δος Ὁ ἢ 7.13 
A Se + se «τῇ OX, 


In general, the stresses p,, depend on the strains εἰ and the temperature. 
In the linear theory of elasticity for small displacements ¢ from an 
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unstrained configuration, the equations are linearized as follows. Since 
Ti = Oi + O(V8), (7.12) is linearized to 


0 7, op Ji 


— = 7.14 
Po 912 ax, 9 ( ) 
the strains (7.13) are linearized to 
wif 2% , Ὁ. 
ey = | ax, + ax, |? (7.15) 
the stress-strain relations are taken to be 
Pg = 2 Me + λεχκδὶ» (7.16) 


where A, μ are the Lamé constants. Strictly speaking, different constants A, 
μ should appear, for example, for isentropic and isothermal motions, but 
the difference is small for most materials. (A good elementary discussion 
of the thermodynamics is given by Landau and Lifshitz, 1959, p. 8.) 
From (7.14)-(7.16), the three equations for the displacements ξ, are 


2¢ 


ἫΝ a = pV + (A+B) eo é). (7.17) 


The divergence and curl of (7.17) lead to 


ὃσιθ- "τ “τ VV ὃ, (7.18) 


 wxg=tvyvxKg, — (7.19) 
912 Po 


respectively. Thus there are two modes, each satisfying a wave equation; 
(7.18) describes compression waves propagating with velocity {(A+2p)/ 
Po} !/? while (7.19) describes shear waves which have velocity { 1/po}'/”. 
The two modes are coupled through the boundary and initial conditions 
placed on the @ or the p,, and the full solutions to problems involve 
considerably more than just solving the wave equation. 


Electromagnetic Waves. 


Maxwell’s equations for a nonconducting medium with permeability u 
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and permittivity ε may be written 


9Bivxe=0, cwHalyxp 
Of ἐ ὃ μ 
γ.ΒΞ-0, V-E=0, 


where B is the magnetic induction and E is the electric field. Therefore 


2 
OBL yy yxpy=tvep, 
εμ εμ 


and E satisfies the same equation. All the components of E and B satisfy 
the wave equation with the propagation speed c=(eu)~'/?. However, 
choosing the components to satisfy V-E=0, V-B=0 couples the com- 
ponents, as do the boundary and initial conditions, so the solution to 
problems is again more than just the solution of the scalar wave equation. 


7.2 Plane Waves 


For one space dimension x, the wave equation is 
ΒΕ 
Pi = © Pxx: 


If characteristic coordinates a= x — ct, B=x+ ct are introduced, it reduces 
to 


02 
Bey 
da 0B 
and the general solution is 
φΞ7}(α) Ἐ5(β) 
=f(x—ct)+g(x+ct), 


where f and g are arbitrary functions. The arbitrary functions are readily 
determined to fit prescribed initial or boundary conditions. For the signal- 
ing problem of outgoing waves with* 


9,=Q(t), x=0, 


*The boundary condition is taken with prescribed p, rather than g to compare with the 
source solutions in spherical and cylindrical waves. 
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the solution Τὸ 
mee ΒΕ} 
φΞ ἐρ! ᾿ ) (7.20) 
where Q,(f) is the integral of Ο(). For the initial value problem, 
φξφρίχ), B= (x), t=0, —wK<x<oo, 


the solution is 


w= 5 {φοῦ - et) τ oolxtet)} Ἐς f φι(ξ)άξ. (521) 


7.3 Spherical Waves 


For waves symmetric about the origin p=q(R,‘), where R is the 
distance from the origin. The wave equation reduces to 


Surprisingly enough, this may also be written 


1 9°(Re) _ 9*(Re) 


c* ὃι2 dR? 


3 


which is the one dimensional wave equation. Thus the general solution 
takes the simple form 


_f(R=¢t) " g(R+ct) | 


Ξ τ (7.22) 


For a source producing only outgoing waves, the solution is 


_ f(R-ct) 
ὩΣ al 


and f is determined from the properties of the source. A convenient 
standard form is to prescribe 


| ὃφ 
es ᾿ 2 e 
QO(t)= him 4πᾺ DR? (7.23) 


216 THE WAVE EQUATION Chap. 7 


this gives 
Q(t) = —4nf(—ct) 
and 
t—R 
φΞΞ- -- —. (7.24) 


In acoustics, dp /OR is the radial velocity and Ο( is the volume flux of 
fluid. 

For an initial value problem, although it is merely a matter of 
determining the functions f and g in (7.22), the solution is more interesting 
than might have been expected. Consider the “balloon problem” in acous- 
tics: the pressure inside a region of radius R, is py>+P while the pressure 
outside is po, the gas is initially at rest, and the balloon is burst at t=O. 
From (7.3) and (7.4), the initial conditions may be formulated as 


Therefore the solution 


_ S(R= at) ze g(R+ aot) 


R R (7.25) 
must have 
J(R) + 3(R)=0, 0<R<o, 
Zk O2RSR: (7.26) 
f(R)—a'(R)=} Pot 
0, Ry< R< oo. 


These conditions determine f and g for positive values of their arguments. 
However, in the solution (7.25), the values of f are also required for 
negative argument. The remaining condition comes from the behavior of 
the solution at the origin. Since there is no source at the origin, we require 
; 2 °F _ 4. 
a OR 
hence, 


f( — aot) + 8 (aot) =0, 0<t<oo. (7.27) 
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This condition determines f for negative argument in terms of the values of 


g for positive argument. 
Solving (7.26) and (7.27), we have 


κο-{ #206 oe 
0, Ry< él, 


L. 
H=| ae me 
0, Ry<é. 


Finally, the solution for the pressure disturbance 15 


P 
P—Po= ap (R- aot) F+ (Rt aot) G }, 
where 


7 | l, —Ry<R—aot < Ry 
0, otherwise, 


=| l, O<R+t+at< Ro, 


0, otherwise. 


The variation of pressure with time is shown in Fig. 7.1. For a point 
R > R,, a discontinuous pressure increase equal to PRo /2R arrives at time 
t=(R—R,)/d; the excess pressure then decreases linearly in time to a 
value — PR,/2R at t=(R+ Κρ))ᾶαρ and it then returns discontinuously to 
zero. Even at R=R,, the discontinuity at the front of the wave is only 
P/2; the remaining P/2 to make up the initial discontinuity P is taken by 
the incoming expansion wave. 

For interior points R<R,, a discontinuous decrease in pressure, 
reducing the initial value P to P(1— R,/2R), arrives at time t=(R)— R)/ 
a); the excess pressure then decreases linearly with time to — PR)/2R at 
t=(R,+R)/ay and then returns discontinuously to zero. Notice at the 
center R=0 these changes are infinite but the whole disturbance lasts for 
zero time! | 

It is interesting that an entirely positive initial distribution of pressure 
leads to an outgoing wave with equal positive and negative phases. In fact 
this N wave profile is typical in two and three dimensional waves. The 
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Fig. 7.1. Pressure signatures in the balloon problem. 


reasons for it can be understood as follows. In an outgoing wave the 
pressure and radial velocity are given by 


Po%o [(Κ -- aot) 
ie cae meee 


_ f(R- apt) = S(R — apt) 
oe ae 


The first point we can make is that in any wave that returns both p — p, and 
u to zero after the whole wave passes, both f’ and f must return to zero. 
Hence, f” has to have both positive and negative parts if the total integral, 
which is f, has to return to zero. A second point concerns the total volume 
flow at large distances. For large R, the volume flow across a sphere of 
radius R is 


4nR*u~4aRf'(R— aot). 


This is large in R. If f’, which is proportional to the pressure, were always 
positive, there would be an infinitely large outward flow as R->co. An N 
wave, however, has a large outward flow immediately followed by a 
balancing large inward flow, so the net outflow is finite. 

For plane waves neither of these effects arises and a positive source 
provides a wave with wholly positive p—p, and u. 
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7.4 Cylindrical Waves 


Hadamard pointed out in his classic studies of the wave equation that 
the general character of the solution is different in even and odd numbers 
of space dimensions. Precise instances will appear below, but we might 
express the result crudely by saying that odd dimensions are easier than 
even. For this reason the three dimensional spherical case was considered 
first and the cylindrical wave solution will be deduced from the spherical 
wave solution. Here only the solution for outgoing waves will be obtained. 

We start from the solution (7.24) for a point source. Suppose such 
sources are distributed uniformly on the z axis with a uniform strength q(¢) 
per unit length (see Fig. 7.2). The total disturbance from this distribution is 
clearly a function only of the distance 7 from the z axis and the time /; it is 
the cylindrical wave produced by a line source. The total disturbance is 


— 1 f° at-R/ejdz 1 * 4([-- R/c)a 
on)=— ae f Rk ἯΙ R τ’ 


where R= Vr7+22. 


q(t)dz 


Fig. 7.2. Construction detail for a line source. 


Various forms of this solution are valuable. If z=rsinh{, R=rcosh¢ 
are substituted in the integral, we have 


] ioe) 


ss ae at 
oS 5. a(t Σ coshs ) αἵ: (7.28) 


R Ἢ 27° 
ἰ-- - ΞΞ : = aoe =: = 
z n z=c\/(t—n) he 


alternatively, if 


220 THE WAVE EQUATION Chap. 7 


we have 


t—rfc d 
(dame [ ome (7.29) 


Formula 7.28 is particularly useful for calculating the derivatives of φ 
and therefore for verifying directly that the wave equation is satisfied. It is 
easily shown that 


‘(0+ 19) -e,= τὶ “Ales ἡ: 
(¢,+ +9, Dit τ a E sinh q(t “ cosh) | ak 


= ee {,.. ἢ 
= Kim { sinh q'(z π cosh) 


2ur 


If q’(t)0 sufficiently fast as t+ — oo, for example if g is identically zero 
until t=0, this is zero. 

For a periodic source q(t)=e~“", we satisfy the condition g(t)30 as 
t—+>—oo by allowing ὦ to have a small positive imaginary part, which 
makes insignificant changes at finite times. From (7.28), the solution for 
the periodic source is 


l = (wr /c)cosh¢ — iwt 
SS - τς Nor d lw , 
φ τ] e Ce 


It is just 
p= — H9( = Je, 


since the integral is one of the representations of the Hankel function. This 
solution would be obtained more simply from the equation by separation 
of variables. The Fourier superposition of such elementary solutions pro- 
vides another derivation of (7.28). 

The first derivatives of φ describe important physical quantities (pres- 
sure and velocity in acoustics); from (7.28), followed by the substitution 
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coshf=c(t—7)/r, 
t—rfe 
ee Oe ce ee ΝΕ 4. (η) η 
cba tomyent | 
ies ( 


ως ἢ ἤ) 
φ, Ξ στε coshSq'(t “ cosh) df 


(7.30) 
{- 4 (η) dn 


r 
-ω (ι-- η΄ - “2, 


These formulas can also be derived directly from (7.29) if judicious 
integration by parts is used to avoid the divergent integrals or if Hada- 
mard’s “finite part” of an integral is used. The latter is incorporated into 
the version using the theory of generalized functions. 


Behavior Near the Origin. 


From the final integral in (7.30) it is easy to see that 
Ll fig as 
O~ 5 [ 4 (n)dn=Z_ q(t) 5 γ-ρῦ. 


Hence the flux per unit length of the line source is 
lim 2nrg,= q(t), 


which checks our definition of q(t). This gives 


q(t) 


ὙΠ 2π 


logr, 


but often the next term in the expansion is required. The expression in 
(7.29) may be integrated by parts to give 


ee: 0 ee τ τὰ a aaa |. 


— © 


t—rfe 
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If we now approximate this for small r by 


ae a 2c(t—n) r 
o~— 9g feos a — 0, (7.31) 


it may be shown by careful estimation that the errors are proportional to r. 
The expression for φ, is the same as (7.31) with φ(η) replaced by ΄ (ἡ). 


Behavior Near the Wavefront and at Large Distances. 


If g(¢)=0 for t<0, the lower limit in (7.29) may be taken to be zero, 
and the solution is nonzero in t—r/c>0. The first signal arrives with the 
wavefront t—r/c=0. If we introduce 


r 
==> 
Cc 


to measure the time elapsed after arrival of the wavefront, (7.29) may be 
written 


-4 { es _ - δι )δη. 8 r=t—->0, 
V(r—n)(r—942r/c) : 


Since ἢ ranges from 0 to τ, the second factor under the radical may be 
approximated by 2r/c when cr/r<1. Hence 


: ἡ d 1/2 
eee TATE , Ue 


27 i ντ ἢ r 
That is, 
1/2 7 
o~-(<) O(t—r/c), ar «l, (7.32) 
where 
1 ( adnan 
O(r) an ΞΞΞΩ (7.33) 


Sec 7.4 CYLINDRICAL WAVES 223 


The expression (7.32) may be compared with (7.20) and (7.24). In all three 
cases we have an amplitude falling off like r-“~/?, where n is the 
dimension of the space. In the present case, however, the formula is not 
exact, and Q is not simply the source strength. A simple interpretation of 
the dependence of the amplitude will be given in Section 7.7, equation 
7.70. 

The expansion in (7.32) can be continued by noting that 


; ᾿Ξ ἐν. 
y=- i 1 fal zy" [1 +e | Pe 


1) 
1 ¢7 4(η) ΤΣ ε(τ-- ἢ) 
᾿ an, (τ ae) 2, ει ) ar } 4 
2. (m=172)!. OG) 7 τς: CT 
αν τ: (-- κι-- 1.Χ2}} m! Gl : ie Ue.) 
where 
(—1/2)! ὦ ¢t m-1/2 
Ont) = Te Fg J, 4 my  αη. (7.35) 


It is interesting that if g(0+)>0, so the source switches on with a finite 
strength, 


Ο, (τ)ᾶ τ δ τοῦ, (7.36) 


Thus the expansion in time after the wavefront passes is in one half 
powers. 


Tail of the Cylindrical Wave. 


One of the important differences between odd and even dimensions, 
noted by Hadamard, is in the behavior of the solution for a source which 
lasts for only a finite time. Suppose φᾷ) is zero except for the time interval 
0«:-« 7. For plane or spherical waves, we see from (7.20)* and (7.24) that 
the disturbance is confined to 


Spe aT and ee ear ὅς 
Cc ς Cc ς 


*For (7.20), φ' was prescribed, so by “disturbance” we mean the quantities φι and φ,. The 
fact that p may be a constant not equal to zero in t> x/c+ T is not counted as a disturbance. 


224 THE WAVE EQUATION Chap. 7 


respectively. The first signal arrives with the wavefront which left the 
source at t=0Q; this must be true in all cases. The interesting point is that 
the disturbance ceases with the signal which left the source at the final 
time t= 7. For the cylindrical wave, (7.29), an integral over the source 
strength q(t) is involved and the disturbance continues after t=r/c+T. 
We have 


T 
1 q(n) dn 


᾿ 
oS —ae -- »-- -ἘΤ. 
2 
ὦ 0 γί:-- η)΄ -- "2)ε2 = 
For fixed r, 
Ll 7 ] 
τως ca a(n) dy | ?? [> 0o. (7.37) 


The disturbance decays to zero only asymptotically as too. 


7.5 Supersonic Flow Past a Body of Revolution 


The most interesting use of the cylindrical wave solution is probably 
in supersonic aerodynamics. As noted in (7.7), the perturbation velocity 
potential satisfies the two dimensional wave equation with 


χεϑί, Μ1-- lot . 
C 
For a body of revolution, (7.7) becomes 
ΒΦ, πῷ, +8,  B=VM?=1, 


where r is distance from the flight path and x is distance from the nose of 
the body. The solution is zero for x < Br and 


x— Br 
1 4(η) 


On 


ᾧΦ-Ξ x > Br. (7.38) 


0 (x—1) — B? 


The source strength q(m) is related to the shape of the body. The boundary 
condition on the body is that the velocity normal to it is zero. Hence if the 
body shape is given by r= R(x), 


®. = R’'(x)(U+®,) on r=R(x). 
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For the linearization in the equations, the body must be slender, that is, 
R(x) is small, and ®, and ®, are both small. Accordingly, the boundary 
condition is linearized to 


® = UR’(x) on r=R(x). (7.39) 
But ®.~q(x)/2ar as r-0, therefore (7.39) gives 
q(x) =27UR(x)R’(x) = US’(x), 


where S(x)=7R7*(x) is the cross-sectional area of the body at a distance x 
from the nose. Intuitively one can see that US’(x) is the rate at which the 
increasing cross-sectional area is pushing fluid out, and this is the source 
strength. The solution for the given body therefore is, 


x— Br 
5΄ (η) ὦ 
ose. Sa, gS. (740) 
2a 2 2.2 
0 (x-—17) —B*r 


The components of the velocity perturbation are obtained by suitable 
modification of (7.30) as 


x— Br 
δ΄ (η) ὦ 
ee Gee (7.41) 
0 (.-- η) - B?? 


χ-- Br is 
mt (x—9)S"(a) ἢ (7.42) 
0 


In linear theory, the pressure is given by (7.8). However, an interesting 
question arises here about the consistency of the linear theory, particularly 
with regard to the pressure. The exact expression for the pressure in 
potential flow is given by Bernoulli’s equation [see (6.157)] as 


y¥/(y¥-)) 


2y¥/(y-)) ss] 
2 -(£) ={1-7— (Uo,+ 52+ 50) 
Po \4o a 2 2 


Therefore, since a2 = ypo/ Po, 


PO 2. 
Po 


lao ἷ ἢ wr 
(Uo,+ 502+ 59 Ἔσται 
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When r is not small compared with the length of the body, ®, and ®, are 
comparable small quantities being of order 5? in the thickness ratio of the 
body 6 (defined as maximum diameter divided by length). Then the 
linearization in which ®2, 6?, are neglected is correct. However, on the 
body r= R(x)=O(6), and for small r, 


US'(x) ον US"(x) 


” Qur ᾿ = 2a 


log r. 


r 


Therefore on the body 


6=0(8), Φ.- O(log. 


Apart from the log(1/6), which in practical situations in not really large, 
the term 4? is as important as the term ®,.. It appears then that one 
should take | 

a Sg Mem (7.43) 
Po 2 


rather than (7.8), for a good approximation to the pressure. Lighthill (1945) 
and Broderick (1949) showed by careful consideration of higher approxi- 
mations that (7.43) is correct with an error O(6*log?1/8). At the same 
time the consistency of the linear theory must be questioned, since the 
boundary condition is applied in the region where ®, and ®, are not of the 
same order. It is shown in the references cited that (7.41) and (7.42) are the 
correct lowest order terms, and the adoption of the nonlinear relation 
(7.43) is the only essential change. 


Drag. 


The drag on the surface of the body due to the perturbed pressure is 


p= f ΓΕ ΠῚ" 


where the integration extends over the length / of the body. The expression 
for ® near the body is 


U tf. 2(χ-- ἡ) 
o~- ¥ i S”(n)log—~—™ a, 
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[see (7.31)], and the pressure is given by (7.43). Hence the drag can be 
written 


Ι 
2am [ 50} - ϑ'Ολιοεκο) 
ροῦ 0 


ἢν Ὁ 2(χ -- ἢ) ι S?(x) 
autho " pat ee Bu Ss. a. 
+ al S” (y)log 3 dn 4π R(x) x 


The first and third terms combine into 


ἰ 12 
- | $2) S"(s)loe RC) +3 () R(x) ἢ 
0 


R(x) 


since §’=27RR’, and this 15 


- [ £(}s7 een Jarno 


for a body with S’(0)= S’(/)=0. After integration by parts, the second 
term gives 


D ροῦ" s” S"“(n)l ΠΡ 
- f of (nog at and 


Po U* 


l l 
wh "(x » Ϊ x 
= Por [fs (x)8"(nlog aa, (7.44) 


[The term in log(2/B) integrates to zero.] This is the famous supersonic 
drag formula first obtained by von Karman and Moore in 1932. 


Behavior Near the Mach Cone and at Large Distances. 


The wavefront is x — Br=0; this is the Mach cone making an angle 
sin~'1/M with the x axis. When (x — Br)/ Br<1, we have from (7.32) and 
(7.33), suitably transcribed to supersonic flow, that 

ξ 
S’ 
as 0) Fe. é=x-— Br. 


2aV2Br 0 νξ--η 
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Hence the velocity components are 


UF(x—B F(x—B = 
Ces, cee Gay 
| V2Br V2Br Br 
where | 
ξ 
1 S”(n) 


ο νξτ-η 
In this limit, ®, and ®, are the same order and the pressure is given to the 
first order by 
P—Po_ F(x- Br) 
ροῦ" V2Br 


(7.47) 


It should again be noted that the behavior near the wavefront and at large 
distances can be combined in one expression. 

If the body is sharp nosed with R’(0)=e, then S(x)~e2x? for small 
x, and we have 


F(€)~2¢7€!/? as é 0. (7.48) 


Figure 7.3 is a typical F(g) curve. The appearance of negative phase is 
typical, even for a shell shaped body for which the source strength US’(x) 
is always positive. In fact it is easily shown that 


[ “Ε(θ4ξΞο, 


~ 


Fig. 7.3. Typical F curve for axisymmetric supersonic flow past a body. 
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and the physical explanation in terms of mass flow is similar to that given 
for spherical waves at the end of Section 7.3. 

It may be remarked that the velocity components and pressure are 
continuous at the Mach cone according to this linear theory. In reality a 
shock wave is produced, and we have the important phenomenon of the 
sonic boom. This is missed because it is a nonlinear effect. The theory for 
it is developed in detail in Chapter 9. 


7.6 Initial Value Problem in Two and Three Dimensions 


One of the many “Poisson integrals” appearing in the theory of partial 
differential equations provides the solution of the wave equation for initial 
conditions 


P=H (x), P= 9,(x), ¢=0. 


According to Hadamard’s ideas, the three dimensional problem will be 
easier than the two dimensional, and we start with it. 

We know from the spherical wave solution discussed in Section 7.3 
that 


Wut) πο 


is a solution for arbitrary € We now argue intuitively that the initial 
prescribed disturbance at any point € gives rise to such a spherical wave 
and propose that the solution should be something like a superposition of 
all the spherical waves. That is, we consider 


W(x, t) -{ vas (7.49) 


The arbitrary function (€) is inserted because, depending on the initial 
conditions, the spherical waves from different points € will have different 
strengths. The form of (7.49) suggests the introduction of spherical polars 
(R,9,) based on the point x. It then becomes 


(x,t) =" ff Wet RDA R—et) RsinddR ddr, 
0 0 “Ὁ 
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where I is the unit vector from x to and its Cartesian components may be 
written 


1= (sin@cosA, sin# sinA, cos@ ). 
With the idea that the initial source strength determining f acts only for an 
instant, we specialize this formula to the case f(R—ct)=6(R-—ct). Then 
T pan 
W (x,t) = ct [ [ ¥(x+ctl)sin6d0 dx. (7.50) 
0 “Ὁ 


Formally, this is a solution for arbitrary Ψ. It may also be written as a 


surface integral 
ψα) τα f VdS, 
cl JS(t) 


where S(f) is the surface of the sphere with origin at x and radius ct. For a 
continuously differentiable function V, we see from (7.50) that 


y—0, ψ,.-94πϑ (x) as t—0. (7.51) 


If we choose V(x) = ,(x)/47, we shall have solved the special initial value 
problem 


Yy>0, ψ,-» φ,(Χ) as t—0. (7.52) 
The solution is 


ψίχ,1) = yp, dS. (7.53) 


a S(t) 
It represents the total contribution of the instantaneous sources which send 
spherical waves to the point x in time ¢; they are all exactly a distance ct 
away and their contributions traveling with speed c arrive at x just at the 
time ¢ (see Fig. 7.4). Notice that all points inside S(t) could still in 
principle have been contributing. But there is no “tail” for spherical waves: 
the sources act for an infinitesimal time and each contribution lasts only 
for an infinitesimal time. This will not be so in two dimensions. In any 
event, (7.53) is formally the solution for the initial values (7.52). It may also 
be written 


W(x,t)=ctM[q,], 
where 


] 
MNGi 4πο 12 “ΟἽ 
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Ὁ 


S(t) 


Fig. 7.4. Construction detail in Poisson’s solution of the initial value problem. R is the region 
of the initial disturbance. 


stands for the mean value of g, over the sphere S(t). 

To furnish the other half of the initial condition we could specialize f 
in (7.49) to 8’ and pursue the consequences similarly to the above. 
However, it is quicker to use a trick that is often useful: if Ψ is a solution of 
a partial differential equation with constant coefficients, then so is any ¢ or 
x derivative. In this case, we note that 


W(x, 
x(x, ἢ) = : ; d 


is a solution of the wave equation, where y~(x,¢) is given by (7.50). 
Moreover, as t—0, we see from (7.51) that 


X=¥,247¥ (x), 
X= Vy = c°VAy—>0. 


To give x(x), X,—0 as t-0, we must now choose V(x) = go(x)/47, and 
take 


_af 
X(% 1) = ot | Anct “gro ἢ. 


The complete solution for general initial values therefore is 


] 
Anct Js(t) 


-. 
oe dt vod | Τ ποι oo 


=< (ctM[o]} +etM [oi]. (7.54) 
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Direct Verification of the Solution. 


It remains to verify directly that (7.50) satisfies the wave equation. We 
have immediately that 


7 27 
en 
Wy. . = et (Θ sin d0 dd 
7 0 “0 95; 


2 
=-lf τὰς 


where ἔξεχ - οἢ]. The ¢ derivatives require a little more manipulation. First, 


ὄπ 2π 
wate [ ‘ 12™ sin dO dd 
: 0 “Ὁ 9§ 


which reduces to 


ς οΨ 
wnt [ὅδ 
᾿ ἱ S(t) 0&? 


in view of the expression for ψ,. We see that 


Y= CM es 


as required. 

These arguments assume that Ψ is twice continuously differentiable. 
The solution (7.54) requires only that φρ and q, are integrable in order for 
it to be meaningful. We might extend the meaning of solution to include 
all cases in which (7.54) is defined. In particular, if gy and @, are piecewise 
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continuous, (7.54) is defined and y—>9,(x), 9, 9,(x) at points of continu- 
ity. 

For the balloon problem in Section 7.3, φρξξθ, 9,;=—P/pp in an 
initial sphere. This is an example of piecewise continuous data. It 1s 
interesting to construct the solution using Poisson’s integral not only for 
the spherically symmetric case but for an arbitrary shape of initial pressure 
region. This is left for the reader. 


Wavefront. 


If the nonzero values of (x) and ¢,(x) are confined to a finite region 
R as shown in Fig. 7.4, the solution at any point outside R is zero until the 
time when S(f) first intersects points of R. It is clear that this occurs when 
ct is equal to the least distance from x to the boundary of R. This least 
distance is on the normal from R to x. The wavefront at time ¢ can be 
determined by turning this argument around. Construct all the normals to 
the boundary surface of R. Measure a distance ct out along each normal. 
The surface formed by these points is the wavefront. Notice that where the 
surface of R is concave, the wavefront will be folded over itself after a 
while (see Fig. 7.5). This construction will be studied further in the 
discussion of geometrical optics. 


Fig. 7.5. Wavefront construction for a disturbance initially confined to the region R. 
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The disturbance at any point x outside R ceases when S(t) becomes 
so large that R is entirely within it. Thus in three dimensions an initial 
disturbance of finite extent produces disturbances which last only for a 
finite time. There is no “tail.” 


Two Dimensional Problem. 


The solution for a two dimensional distribution of initial values may 
be treated as the special case in which g(x) and »,(x) are independent of 
x3. Suppose nonzero values of 9 (x,,x>), @,(x;, x2) are confined to a finite 
region Ry of the (x,,x,) plane. From the three dimensional point of view, 
they occupy the cylinder R with generators parallel to the x, axis based on 
the cross section Ry. The initial disturbance is no longer finite in extent. 
For a point x outside the cylinder R, the wavefront construction is as 
before, but the spheres S(r) centered at x will intersect R at all times after 
the first time of intersection. This accounts for the “tail” in two dimen- 
sional waves, and it vividly shows the difference between two and three 
dimensions. 

In the solution (7.54), the value of (x,/) must be independent of x3. 
The integrals can be reduced to two dimensional form to show this 
explicitly. We consider the value of 


Ι 
= ——_ dS 
M | 90] Aarc?t? ss)" 
at a point (x,, x,,0). 
At a point (£,,&,&3) of S(t) (see Fig. 7.6), the value of @ is φρίξ;; 45). 


The outward normal has a direction cosine /, with the x, axis equal to 


ξ ᾿ γοῖ- αι -ἐ)π- α,-ε,} 


Οἱ cl 


The surface element dS is equal to dé,d&,/|/,| where dé,d&, is its projec- 
tion in the (x,,x,) plane. Therefore, remembering the two equal contribu- 
tions from above and below the plane, we may write 


1 If φρίξ! ξη)4ξ dé, 


Μ]|φ9]Ξ 
ee Ne Ree ey. 


3 


where a(f) is the interior of the projection of δ'({) onto the (x,,x,) plane: 


a(t): (x,-§&)° + (9-8) « εξ, 
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Fig. 7.6. Geometry involved in descending from three dimensions to two in the initial value 
problem. 


The full solution reduces to 


0.1 φρίξ;. ξ2) 4ξ᾽ ἀξ) 
60%) τ, {|| Ἐς --ς 
a(t) ct ~(x,-é,) — (x,-§) 


re (a= (7.55) 
a(t) 04? — (x,-£) — (2) -&) 


One notes the similarity with (7.29). 

Since the integration is over the whole of the inside of the circle 
(x,—&)’'+(x,—£)=c7t?, not just its boundary, the disturbance continues 
even after this circle completely surrounds the initial region Ro. 


7.7 Geometrical Optics 


In the discussion of one dimensional problems in Sections 5.5 and 5.6, 
the role of the characteristics as carriers of discontinuities was developed. 
It was also shown that the variation in the magnitude of the discontinuity 
could be found directly from the equations without finding the complete 
solution. The same is true in more dimensions, and the theory of discon- 
tinuities for linear equations is one version of geometrical optics. The 
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second version concerns periodic waves in the high frequency approxima- 
tion. The two cases are intimately related, since the Fourier analysis of 
discontinuous functions relates the singularities to the high frequency 
behavior. It turns out that both aspects of geometrical optics can be 
combined into the same derivation. 

Geometrical optics is particularly important when the exact solution 
cannot be found explicitly or is exceedingly complicated. Even for simpler 
problems it is often easier to find the wavefront behavior in this way 
rather than untangle it from the exact solution. We develop the ideas on 
the wave equation, then show the applications to waves in nonhomo- 
geneous media (for which exact solutions may not be obtainable) and to 
anisotropic waves (which are complicated). In the next chapter, an 
approximate theory for shockwave propagation is developed from the 
ideas of geometrical optics. Due to nonlinearity and number of di- 
mensions, such problems are extremely difficult to tackle in other ways. 

The main use of the discontinuity theory is to determine the behavior 
of a wavefront spreading into an undisturbed region. We suppose that the 
wavefront is specified by the equation S(x,t)=0, and that is identically 
zero in S(x,t)<0. The surface S=0 and the behavior of the discontinuity 
in g or its derivatives are to be determined. If the mth derivatives of are 
the first ones to be discontinuous at the wavefront, we assume that m may 
be expanded in the form 


gm gmtl 
PO Gai ae 


0, S<0 


Q= 


The coefficient ®,(x) determines the variation in the magnitude of the 
discontinuity. As we saw in the case of cylindrical waves, the singularity at 
the wavefront may involve fractional powers, so we allow m to be nonin- 
teger. The idea then is to substitute this series into the equation for q, 
equate the coefficients of successive powers of S to zero, and thus obtain 
equations for S and the ®,. In doing this, however, it turns out that all we 
require is that m should take the form 


p= Σ O,)/ (5) (7.56) 


where the f,(S) have the property 
In(S)=Sn—1CS). (7.57) 
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In calculating the derivatives of φ, derivatives of the f, will appear but they 
can all be replaced in terms of earlier members of the set by making use of 
(7.57). Then the equation may be satisfied by equating the coefficients of 
the successive f, to zero. This allows a number of important extensions, 
and the preceding case is included because 
nt+m 
| oy νὰ 
f,(S)= 4 (n+m)! (7.58) 
Lo, S<0, 
satisfies (7.57). 

We will carry out the substitution in the form (7.56), but for the 
present one should think of this as notation for (7.58). Then we will come 
back to consider the extensions. Also for the first run through we will 
consider m> 2. Strictly speaking, the meaning of “solution” is extended if 
the derivatives appearing in the equation are not continuous. For the wave 
equation this requires continuous second derivatives and m > 2. As we shall 
see, there is a real point to this, not just ultra caution! 

In the substitution, first and second derivatives of the f,(S) will 
appear, but they are replaced according to (7.57) by 


μ(5)Ξ... (5), fi (S)=Sp-2(S). 


For n=0, 1, these will introduce f_,(S) and f_,(.S) which do not appear in 
the original set. For (7.58) with m>2, they are defined by the same 
formula; in other cases their definitions will have to be included in 
defining the f,(S). After substitution in the wave equation, we have 


[s2—c-993] 9/-. 
τ ἰ [ S? _ cS? |®, +258, Do, +(S. - c~S,,)®o} a 


+ S Ef(S)=0. (7.59) 
n=0 


The expression for E, will not be needed beyond noting that it contains the 
leading term 


[s?- cS? |®,42 


followed by further terms in ®,, ,,...,®) and derivatives of S. 
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The wave equation will be satisfied if the coefficients of f_5,f_,,..., 
are individually zero. Thus 


5.2. ο-ἢς2- 0, (7.60) 


a® 
2S, ~— +(Syx,—¢77S,,)®y=0, (7.61) 


0x; us 


and the subsequent equations determine ®,,®.,,..., successively. The equa- 
tions for S and ®, are the ones of main interest. Before discussing their 
solution, we return to the question of extending the application by suitable 
choice of the f,. 


Discontinuities in ῳ or its First Derivatives. 


If m<2 in (7.58), the question of the definition of f_, and f_, arises, 
and this is bound up with the extended meaning of the solution. For m=2, 
that is, for discontinuities in second derivatives, the definition of f_, and 
f_, by (7.58) is still straightforward and the extended meaning of the 
solution is merely that the equation is satisfied separately on the two sides 
of the wavefront S=0. 

If φ itself is discontinuous, m=O, the series behind the wavefront is 


p= Op(x)+O,(x)S+ 5 O,(x)S? Ὁ τ5 τ, 
If this alone is substituted into the wave equation, the first two terms in 
(7.59) do not appear, and (7.60), (7.61) are lost. If, however, we take 
p= P(x)H(S)+O(x)H(S)+---, 
where H(S) is the Heaviside function 


l, S$>0, 


H(s)=| 
0, S<0O, 


and the H,(S) are its integrals defined by 


Ϊ 


H(S)=| nt 
0, S «0, 


Ss”, S>0, 


and if, in addition, the derivatives are taken in the generalized sense, we 
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have 
φ,Ξ ®)S,5(S)+8,S,H(S)+-°°, 


Pu = ®S,5 (5) x (Φ, 5. τ Φ,.5.,),)0 (5) pany 
and so on. Then (7.59) is obtained with 
fo=H(S), f_,=86(S), f_2=6(S), 


and the information on S and 9, is not lost. In the case m2>2, this 
difference does not arise. The explanation is that for m<2, we really are 
extending the discussion into the realm of “weak solutions,” and the 
extended definition does include information on the possible discontinui- 
ties, by whatever means the extension is carried out. For linear problems, 
the required extension is immediately obtained by allowing generalized 
functions such as the delta function and interpreting the derivatives in that 
sense. It is equivalent to the method indicated in Section 2.7. 

If φ is continuous, but first derivatives are discontinuous, the 
appropriate series would be 


p= O(x)H(S)+O,(x)H(S)+---, 
and in this case 


f(S)=H(S), f(S)=H(S), — f_(S)=A(S). 


Wavefront Expansion and Behavior at Large Distances. 


If (7.56) is viewed as an approximation for the behavior of the 
solution near the wavefront, rather than just as a device for studying the 
discontinuities in derivatives, the validity can be extended by allowing the 
f,(S) to be more general even than powers or step functions. For instance, 
in cylindrical waves, the wavefront expansion (7.34) takes the form (7.56) if 
we introduce 


(—1/2)! 1 Ss ai r 
(9) = ---------- 5 -- an, S=t--, 
SAS) τ Ἐπ J η(ηλ(8 -- η) ἢ , 
(n—1/2)! ες \itl/2 
®,(7)= n!(—n-—1/2)! Ee 
This expansion was found to be valid for 
2) 


2r 
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Thus S need not be small, provided r is large enough. The functions f,(S) 
satisfy the crucial relation (7.57) and so this expansion is included in the 
development here. 

We expect in general that the expansion (7.56) with appropriate f,(S) 
will give the behavior in some extended region behind the wavefront. Of 
course, the precise forms of these more general f,(S) can be known only 
from more complete solutions; they are not determined from the substitu- 
tion of (7.56) into the equation. But the determination of S and ®, from 
(7.60) and (7.61) still gives valuable information. Typically, this extension 
gives the behavior for large distances in the sense that cS/|x| is small. In 
the first approximation f,(S) gives the wave profile and ®,(x) gives the 
amplitude decay as x—> 00. 


High Frequencies. 


In wave propagation there is often interest in solutions periodic in 
time with a given frequency w. If the equation for q is taken more 
generally now in the form 


L, Ss Prt 


where L is some linear operator independent of ¢, periodic solutions may 
be written 

p= O(x)e~ 
where 


2 
LO+ 5 Φ-ο. 
ς 


For large values of w/c (normalized by some suitable length in the 
problem, possibly x itself), a standard method of finding the asymptotic 
solutions is to take 


Ze —n 
O~e”™  ᾧ (x)(—iw) , 
n=0 


where the functions o(x) and ®, (x) are to be determined. In terms of φ this 
15 


pr~e~#E-9OD)  ᾧ (x)(— iw)”. (7.62) 
n=0 


It may be rewritten as 


ὩΣ > ®,(x)f,(S), 
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with 


e i 
(iw) 
Moreover, with this definition of f,(S), (7.57) is satisfied. Therefore the 
equations for S and ®, are exactly the same as in the wavefront expansion; 
there is no need to rederive them, and we also see why results for the two 
cases are the same. 

In this application, the surfaces S=constant are surfaces of constant 
phase (e.g., crests and troughs), while ®,(x) determines the amplitude of 
the oscillations at x. 


S=t—o(x),  f,(S)= 


Determination of S and ®o. 


We continue now with the discussion of the equations for S and ®p in 
the case of the wave equation. The description will be given in the 
language of wavefront propagation, but the high frequency interpretation 
is obvious. 

Equation 7.60 for S is often called the eikonal equation. It defines how 
the surface S=0 moves in x space. The unit normal to the surface is given 
by the vector | with components 


= δὲ 
l; ΞΞ Vs] ᾿ 
The normal velocity can be calculated by noting that neighboring points 


(Xp,t)) and (x)+16s,t)+ δὴ are on the surface at neighboring times pro- 
vided 


S(Xpf9)=0,  S(xp+1ds,to+ dt) =0. 
Hence to first order in δὲ and 61, 


ἰδ. 65 + δ, dt =0, 


and the normal velocity is 


Bm ee Ts. NST (7.63) 


Thus the eikonal equation states simply that the wavefront has normal 
velocity +c. 
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In the further construction of the solution, it is convenient to specify 
the wavefront in the form 


S(x,t) =t—o(x)=0. (7.64) — 


The family of surfaces o(x)=constant gives the successive positions of the 
wavefront in x space. Equations 7.60 and 7.61 become 


Ι 
σῇ Ξε x (7.65) 


ὃ, 
2σ, Ox, ON, x; Py ΞΟ. (7.66) 


The nonlinear equation for σ may be solved according to the method 
of Section 2.13 by integration along its characteristic curves. If we intro- 
duce p;=o, and write the equation as 


] 


H=1¢p?- sc '=0, 


2 
the characteristics defined by (2.86) are curves in the x space with direction 


ἄχ 
τ 


Normalized in this way the parameter s is the distance along the 


characteristic, because reuse of the equation shows that c’7p?=1. The full 
set of characteristic equations (2.86)(2.88) is 


dx, dp; 


7 (7.67) 


; = CPi, 


a do _ 
ds Ἐν; ds 


Co | 


These can also be derived directly from (7.65), using 


ds Pi 9x, 4 Ox,’ 


without quoting the general formulas. Since the vector p, = σι, is normal to 
the wavefront o=constant, the first of the equations (7.67) shows that the 
rays are also normal; they are the orthogonal trajectories of the wavefronts 
o=constant. The second equation shows that p is constant on the ray; 


hence so is the ray direction cp, and the rays must be straight lines. The 
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rays can be constructed, then, by drawing the family of straight lines 
normal to the initial wavefront. The third equation in (7.67) may be 


integrated to 


0 


ll 
Ω [& 


where s is measured from the initial wavefront. At any time ¢>0, the 
wavefront t=o=s/c is a distance ct out along the rays. This is exactly the 
construction deduced for Poisson’s exact solution and represented in Fig. 
7.5. Formally, if xj is a point on the initial wavefront and I(x) is the unit 
normal at that point, the solution of (7.67) is 


K=XotH(x)s, ρΞο (0), =F. 


This is an implicit form for o(x); given x, the initial point xp and distance 5 
are determined, in principle, from the first of these, then o=s/c. 

These results are special to the wave equation. In general the rays, 
defined as the characteristics of the eikonal equation, are neither straight 
(nonhomogeneous medium) nor orthogonal to the wavefronts 
(anisotropic). 

Equation 7.66 is a linear equation for ®o, and its characteristics are 
the same rays already introduced for the eikonal equation. It may be 
written in characteristic form as 


1 4Φ, Ι 
ᾧ; ds = 5 OOx:x,? (7.68) 
the integration is straightforward, in principle, once o(x) has been deter- 
mined. It shows that ®, is to be obtained by integration along the rays and 
its variation is somehow related to the divergence of the rays as measured 
by σι, But, due to the implicit form for o(x), it is more illuminating to 
proceed a little differently. 
First we note that (7.66) takes the divergence form 


Fx (08) =0 (7.69) 
which suggests that something is conserved, and perhaps also suggests use 
of the divergence theorem. We consider a tube formed by rays, going from 
the initial wavefront 5): o=0 to the wavefront ὦ :o=¢ at time /, as shown 
in Fig. 7.7. We integrate (7.69) over the volume inside the ray tube and use 
the divergence theorem to write the result as 
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Fig. 7.7. Wavefronts and ray tube in geometrical optics. 


[ n,o,, B2dS =0, 


where n is the outward normal and the surface integral is over the sides > 
and ends 5g, 5, of the ray tube. In this case the rays are orthogonal to the 
wavefronts o=constant. Therefore 


and that contribution drops out. On 5 ,n and Vo are in the same direction 
50 


> 


n,0,,=|Vo| on ὦ, 


and from the eikonal equation (7.65), |Vo|=c~!. On So, n and Vo are in 
the opposite direction, so 


no, = --Ἰ  σ[Ξ -- τ on ὅὧ. 


In this case c is constant, and we have 


1: @2dS = dh Φ245. 
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If the ray tube is taken to be very narrow with small cross-sectional 
areas A@, on 5, and A@ on 5, this may be written to first order as 


Φι(Χ)Δ ᾧ = Φρί(χο)δ Gy, 


or, in the limit A@,, A@ 0, as 


®o(x) -( d@ . (7.70) 


It is usual to interpret (7.70) in terms of the flow of energy down the 
ray tube, particularly when the context is the high frequency approxima- 
tion for periodic waves noted in (7.62). There is an average flow of energy 
across any section of the ray tube, and without performing a detailed 
calculation it is clear that the flux is proportional to ®2 A@. Hence (7.70) is 
equivalent to a “law” of constant energy flux along the ray tube. For 
nonuniform media (as will be indicated below) there are extra factors, 
depending on the medium, multiplying ®2A@, but the law of constant 
energy flux remains true. It is in fact a general result of geometrical optics 
for nondispersive waves and it is often used directly to determine the 
amplitude variation without going through the detailed calculation each 
time. Recent work on dispersive waves has provided general arguments on 
this kind of question but has also changed the point of view. The more 
general concept appears to be conservation of “wave action,” which in the 
simplest linear cases is energy flux divided by an appropriate frequency. 
Here the frequency is constant and so the two are the same. These general 
questions are discussed in Part II. 

For plane, cylindrical, and spherical waves, the ray tubes are straight 
channels, wedges, and cones, respectively. Therefore we have 


da, =|, ®,= constant, 
d@ = 

d@o mie. Doar ue 
4 = R?, Φια καὶ =, 
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in these cases. They check with the earlier results, obtained from the exact 
solutions, for the behavior near the front or at large distances. In two 
dimensions, without cylindrical symmetry, 


A@y=R,A0, A@=(R,+5)d8, 


where R, is the radius of curvature of the initial wavefront, and ΔΘ is the 
angle subtended at the center of curvature (see Fig. 7.8). [The ray construc- 
tion shows that the radius of curvature of the wavefront at a distance 5s 
along the ray is (R, +5) and the angle subtended remains as Δθ.] Hence 


D)(x) -( Κι i. 
P (Xp) ) 


R,+s 


Fig. 7.8. Geometry of wavefronts and rays. 


In three dimensions, a little differential geometry shows that the surface 
elements are proportional to the Gaussian curvature: 


Δῶρα R,R,, A@x(R,+s)(R, +5), 


where R,,R, are the principle radii of curvature of the initial wavefront. 
Hence | 


D(x) -| RR, \ , 
@o(Xo) | (Ry +5)(R +5) 
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Caustics. 


At points where the initial wavefront is concave outward, the rays 
form an envelope as shown in Fig. 7.9. This is typically cusp shaped, and © 
between the two arms the region is triply covered by rays; it is like a fold 
in the sheet. The envelope is called a caustic. At the caustic, since 
neighboring rays touch each other there, d @ /d @ )—0. According to (7.70) 
this means that ®,->0o there. For the wavefront problem this result is 
correct and can also be established from the exact solution of the wave 
equation. Whether the wave equation still applies is another question. In 
acoustics, for example, it is obtained only after linearization; it would be 
invalid due to nonlinear effects when ®)—>0o. In Chapter 8 the nonlinear 
behavior of shocks is discussed, and it will be argued. (except possibly for 
extremely weak shocks which are affected more by viscous effects) that as 
a concave shock focuses it also speeds up and avoids overlapping itself. 


Fig. 7.9. Formation of a caustic. 


For high frequency waves, the geometrical optics approximation is 
invalid at the caustic, even as an approximation to the wave equation. The 
singular behavior of ®) makes the expansion (7.62) nonuniformly valid in 
the neighborhood of the caustic. The correct behavior was first investi- 
gated by Airy and more recent work has been carried out by Keller and his 
co-workers (see, e.g., Kay and Keller, 1954). The correct result is that the 
amplitude remains finite but large in w, typically being proportional to a 
fractional power of w. This topic is a little more special than the rest of this 
chapter and the reader is referred to the original papers. 


7.8 Nonhomogeneous Media 


In nonhomogeneous media the governing equations have coefficients 
which are functions of x. The expansion method still goes through, but the 
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equations for o and ®, are modified. In isotropic cases the only change to 
the eikonal equation is that c=c(x), so we can usefully discuss the 


consequences before looking at specific examples. The dependence on x 
modifies the characteristic form for the rays. With p,= o,, as before, and 


ast 2k -ὶ = 
H = 5 ¢(Xx)p; 5“ (x) =0, 


the characteristic equations are 


dx; _ 0H 

ds ὃ, ἜΣ 

ap; 0H OH Cx, 

ΠΣ, aS νὰ; 
do_, 0H 1 

ei ip; ς 


Since p is normal to the wavefront o=constant, the first equation shows 
that the rays are still orthogonal to the wavefronts. However, the second 
equation shows that cp, is no longer constant on a ray, and so the rays 
bend around in response to the gradient in c. The rays have to be found by 
solving the first two equations simultaneously for x,(s),p,(s), then the third 
equation gives 


Of course o is the time of travel of the wavefront along the ray. 

Fermat’s principle states that this time is stationary compared with 
neighboring paths between the two points. We can check this. Let an 
arbitrary path between the two points x=a and x=b be specified para- 
metrically by 


x =x( 1), O0<p<l, x(0) =a, x(1)=b. 


(To apply the usual methods of the calculus of variations, it is convenient 
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to normalize the parameter so the integration is over the same fixed range 
for all paths. Thus s is not a convenient parameter for this argument.) The 
time on each path is 


] 2 


oe nee [2] d (7.72) 
4 dup “" | 


From a standard argument in the calculus of variations, the x,() for a 
stationary value of any integral 


r= f F(x(wi(w)} ay, (=F, 


satisfies 


4(38). 5.0 
ἄμλ dx, OX; 


In our case, this reduces to 
dx. 
(iii TM), bye c, =0. 
du Cc [. 2 du ς : 
x; 


If, on this stationary path, we revert to s as parameter, by means of 


V2 du. = ds, we have 


d{1@\ 1. _ 
4: 7) + 6,70. 


This agrees with (7.71) and we have proved Fermat’s principle in this case. 
Fermat’s principle gives an immediate and illuminating picture of why the 
rays are straight when c is constant. 


Stratified Media. 


For a stratified medium in which c depends only on the vertical 
coordinate, y say, we can simplify the ray calculation further. First, any 
ray remains in the same vertical plane in which it starts. Thus it is 
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sufficient to discuss the two dimensional case with x horizontal and y 
vertical. The speed is c=c(y), and (7.71) reduce to 


ax _ Ca 
dy Pv Gq Px» 


Φι #2 “Ὁ), ὦ 


ΓΝ ds c2 : ds 


Since p, is constant and dx/ds=cp,, the angle @ of the ray to the 
horizontal is given by cos#=p,c(y), and if subscript zero refers to some 
initial point on the ray, we have 


_ cosdy ςοϑθ «(») 


μι,  οοβθὺυ 


(7.73) 
This is just Snell’s law in optics. 

The component p, can be found by solving the equations for y and p,, 
or better still by noting that p?+p3=1/c? (the eikonal equation itself) 
gives 


Ι cos’ 6, 


p=\/—— - 
c*(y) δ 


The ray equations may be combined into 


dx _ Pi _ c(y) cos 8y/ Co 


= : (7.74) 
oe ¥1—c?(y)cos?6,/c2 
Therefore the ray with initial angle 0) at (x,y,) is given by 
ἦς e(y)c0sO5/ co 
χΧ-- Χρξ ΞΞ. -ς-- ν - ΞῶΣ (7.75) 
yo V1—c?(y)cos?,/c2 
The time of arrival of the wavefront is 
(7.76) 


= [4-[ Z- f —_4#___. 
0 © Sng “P2 Syy c(y)Vl— cy) δον δ, c2 
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It should be noted that all we have really used in deriving these results 
is Snell’s law; we viewed it from the more general characteristic theory. 

Harmless appearing distributions for c(y) give some remarkable ef- 
fects for the rays. We note two examples. 


Ocean Waveguide. 


Suppose c(y) is as shown in Fig. 7.10 with variations in c confined to 
a layer |y|< Y, c=constant c, outside the layer, c<c, inside the layer, 
with a minimum c, at y=0. We consider the rays from a point source at 
x=y=9Q. 


Fig. 7.10. Rays in an ocean wave guide. 


As c increases along a ray, cos9@=ccos@)/cpy increases, 8 decreases; 
the ray bends toward the horizontal. If @)>cos~‘(co/c,), the ray 
penetrates into the region c=c, and remains straight thereafter. However, 
if 8, <cos~ '(cg/c,), cos@ increases to 1 and @ decreases to zero at the value 
of y for which | 


Co 


> COs Ay 


At this point the ray turns around, crosses the minimum in c again, and 
repeats the pattern symmetrically about the x axis. These rays, then, 
oscillate about the x axis as shown in Fig. 7.10. 
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The channel | y|< Y forms a sort of waveguide, and at points inside it 
sufficiently far from the source there may be a number of overlapping rays. 
Thus geometrical optics predicts a succession of signals. Moreover, it can 
be shown from (7.76) that the signals off the center line may arrive faster 
than those on the center line. They have farther to go but benefit from the 
faster propagation speed. In this situation the amplitude predictions of 
geometrical optics are not valid and one must turn to more exact treat- 
ments (see Cohen and Blum, 1971, for some recent work). 


Shadow Zones. 


For a source below a maximum of c, as shown in Fig. 7.11, one can 
similarly deduce that a shadow may be formed, into which the rays do not 
penetrate. The rays are sketched in Fig. 7.11. 


y 


shadow 


Fig. 7.11. Formation of a shadow zone. 


Energy Propagation. 


The modifications to (7.66) depend on the particular problem and on 
which particular physical quantity is denoted by φ. The remarkable thing 
is that the result is always constant energy flux down the ray tubes. We 
may verify this for acoustics in a nonuniform medium. To keep the 
analysis simple, we consider a fluid initially at rest with uniform pressure 
_ and no body forces, but with arbitrary density distribution p(x). We might 
imagine a heated layer of fluid in which the gravitational effects are of 
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smaller order and may be neglected. The linearized equations* for the 
perturbation pressure P, perturbation density R, and perturbation velocity 
V are 


R,+ V-(pV)=0, 
pV,+ VP=9), (7.77) 
P,—c7(R,+V:Vp)=0, 


where c?(x) is the sound speed. In terms of P alone, the equation is 


V 
P= 2 VP - “2.9 P| 


From the first two terms of the series 


P= > PaO {1-o(x)}, 


we find 


2 ] dP Px, 
σι = 2 ; oo + σιν, are Py =0. 


If V and R are expanded in similar series, the coefficients V, and R, may 
be determined in terms of P,, and o by going back to the original three first 
order equations in (7.77). In particular, 


P,Vo P 
Vo= ᾿ Ro= τ. (7.78) 


The equation for P,(x) may be written in divergence form as 
0 Poo, 
axl pf 
Integration over a narrow ray tube as in the argument preceding (7.70) 


*The notation is changed from Section 6.6 to avoid clashing here with the use of subscripts to 
denote the successive terms in the wavefront expansion. 
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gives 


Pe 
= A@ =constant. (7.79) 


The extra factor pc, which is a function of x, modifies the pressure 
amplitude in addition to the modifications of ray divergence. 

The interpretation of (7.79) as energy flux is best given in the high 
frequency application of geometrical optics. The average rate of working 
of the pressure on the fluid crossing the element of area A@ is 


PV, Ad, (7.80) 
where Κρ, is the component of V normal to A@. From (7.78), 
P P 
Von = on -Vo= = ὃ 
pc 


Hence 


P?2 
pc 


Thus (7.79) shows that the energy flux remains constant along the ray tube. 


7.9 Anisotropic Waves 


When there are preferred directions in the medium, the eikonal 
equation may not be symmetric in the σι. AS a consequence, the rays 
defined as the characteristics of the eikonal equation are no longer orthog- 
onal to the wavefront. If we assume that the medium is homogeneous, so 
that x does not appear explicitly in the eikonal equation, we may write the 
eikonal equation as 


A( P\,P2P3) =9, Pp; = 9,. 


The characteristic equations reduce in this case to 


ax; dp; do 
The p,; are constant on these rays; hence the ray direction H, is constant 
and the rays are straight lines. However, the ray direction is parallel to the 
wavefront normal if and only if 

Η, Xp;. 


P; 
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This is true if and only if H is a function of pj+p?+p3, that is, if the 
propagation 1s isotropic. 

A simple example of anisotropy is provided by the wave equation in a 
moving medium. If the medium has velocity U in the x, direction, and if ay 
is the propagation speed when the medium is at rest, 


2 
,...1[ ,,,..3. 
ἽΝ ala t Yas) & 


(The symbol c is reserved for the normal speed of the wavefront which is 
not dy.) The eikonal equation is 


o2 = 1. (1- Us, )’, (7.82) 


and we take 
] 2 = 2 
Η-- {ρἦ-- ας (1— Up,) ). 


The ray direction has components 


dn @ ape CaN Te Ὡς Ὁ 


This clearly is not in the direction of the wavefront normal p. For a point 
source the wavefront is just a sphere of radius apt swept downstream a 
distance Ut. The rays are shown in Fig. 7.12. 


Fig. 7.12. Wavefront and rays for an acoustic pulse in a wind. 
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The result that the rays are not orthogonal to the wavefronts is surely 
surprising at first encounter. The intuitive feeling that the wavefront is 
moving out along its normals is natural and one might expect as a 
consequence that the orthogonal trajectories play a basic role in the 
geometry. But it just is not so. The point is that the rays are concerned 
with energy propagation and neither the speed nor the direction need 
coincide with the normal velocity of the wavefront. This is the first 
appearance, in a restricted form, of the important difference between 
phase velocity and group velocity. It will be discussed in the general form 
in Chapter 11, and more detailed examination of the underlying concepts 
is postponed until then. 

We return to the general case in (7.81). Since the p, are constant, the 
integration of the equations is immediate. We consider the case of a point 
source at the origin. In terms of the distance s from the source, the solution 
of (7.81) is 

x,=1s,  o=p,ls, (7.83) 


i i 


where the unit ray vector 1 is defined by 


a (7.84) 


The family of rays is obtained by varying the p, over all values satisfying 


H(P},P2,P3) =0: (7.85) 


Each choice of the p, determines a ray, and at time ¢ the wavefront o=1 is 
a distance 


ee 
s= ἘΣ (7.86) 
along it. The coordinates of the point on the wavefront are 
i (7.87) 
Xp et . 
SB 


varying the parameters p,, p>, p3, subject to (7.85), gives the whole 
wavefront. 
From (7.63), with the reduced form δ᾽ = t— o(x) which is used here, the 
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normal velocity of the wavefront has magnitude 


= | = 
|Vo| 


Cc 


S| 


Therefore the unit normal to the wavefront is given by 
nN; = CP,, 
and the angle » between the ray vector and the normal vector is given by 


cos p= Un, = clp,. 


Therefore (7.86) may be written as 
.---Ξ- ᾿ (7.88) 
COS LL 

The wavefront moves along the ray with increased speed c/cosy so that its 
speed along the normal is c. 

It is sometimes convenient to use c=1/p and the unit normal n,;= cp, 
as parameters in place of p,, p>, Ρ95ῳ. Then the eikonal equation (7.85) 
determines c as a function of the direction n. This function c(n) specifies 
the anisotropic medium and the geometry of the rays and the wavefront 
can be expressed completely in terms of it. This description is particularly 
convenient for two dimensional and axisymmetric problems. We describe 
the two dimensional case in (x,,x,) space, but note that the axisymmetric 
case is exactly the same with x, interpreted as distance from the axis of 
symmetry. 


Two Dimensional or Axisymmetric Problems. 


If the normal ἢ makes an angle y with the x, axis, the eikonal 
equation provides the function c() for the propagation speed. To express 
the rays and wavefront in terms of c(wW), we shall need to find the direction 
of the ray vector ] in terms of c(). It turns out that it is most useful to do 
this by finding the angle μ᾽ between the ray vector | and the normal ἢ (see 
Fig. 7.13). Since 1 has the direction 0H / 0p and n has the same direction as 
p, this angle can be found from an argument in p space. The vector H, in p 
space is first written in terms of polar coordinates p and y. It has a 
component dH/dp in the direction of p and a component 0H/pdy 
perpendicular to p. Hence 


be jase! 0H / oy 
ae 0H /dp- 


(7.89) 
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Fig. 7.13. Geometry of wavefront, rays, and normals in an anisotropic medium. 


In terms of the function c(), an equivalent eikonal equation for H( PW) 
can be written: 


H =pc(w)-1=0. 


Therefore 
ο΄ 
ἴδῃ μΞΞ (ψ) Ἢ (7.90) 
c(y) 
This is the crucial expression relating the directions of 1 and n. 
The rays are given by 
X,;=scos(ut+y)t, x ,=ssin( pty). 
The wavefront (7.88) is at a distance 
Cc t 
s= (W) =t γε τ (7.91) 


COS ft 


along the ray with parameter y. Therefore in Cartesians the wavefront is 


_c(pit 


t 
COS pL ey Sas COS pL 


sin( wt+w). 


If these are expanded and μ eliminated from (7.90), we obtain 
x,= {ἐ(ψ) cosy—c'(p) siny } 4, (7.92) 
x2={¢e(p)sinyte'(y) cosp}t. (7.93) 
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The geometry is shown in Fig. 7.13. | 
Another derivation of (7.92) and (7.93) is obtained by arguing that the 
wavefront is the envelope of elementary plane waves 
x, cosy + x,siny=c(y)t. 
The envelope is found by solving this simultaneously with its y derivative: 


—x,siny+x,cosp=c’(y)t. 


The solution is (7.92), (7.93). This derivation is simpler, but it is limited to 
homogeneous media and it does not bring out the ray properties. We have 
preferred to unify all cases by one method, the method of characteristics 
applied to the eikonal equation. 


Source in a Moving Medium. 


To illustrate the results, let us apply them to (7.82). With p, =cosy/c, 
p,=siny/c, the eikonal equation gives 


Therefore 


c(y) = Ucosy+ ay 


for an outgoing wave. The wavefront (7.92)-(7.93) is 
x,=(U+ay,cosy)t, 
X= (aypsiny)t. 


This is a circle of radius apt centered at a point Ut downstream as required. 


Magnetogasdynamics. 


Again rather harmless appearing problems lead to surprisingly com- 
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plicated geometry. An interesting example of this arises in magnetogas- 
dynamics. In an infinitely conducting medium with uniform magnetic field 
in the x, direction, perturbations satisfy 


κ᾿ 
0? 
a —(a? τὴ) ὃς Vp + a*b* — ax? ΞΎ *p =0. (7.94) 
The eikonal equation is 
1 — (a?+ b*)p? + a*b*p? pp? =0. 
With p, =cosy/c, p=1/c, we have 
c4 — (a? + b?)c? + a*b? cos? =0. (7.95) 


There are two outgoing wavefronts (the fast and slow waves), correspond- 
ing to the increased order of (7.94). 

In this case, it is convenient to work with the polar form (7.91), where 
the wavefront is distance 


s=1VF4E (7.96) 
in the direction 
_ ec’) 
ξ(ψ) Ξμίψ) ἐψΨ, = tanu= ea (7.97) 


From (7.95), the derivative c’(W) satisfies 
{2c?— (a*+b*)c}c’—a*b*siny cosy =0. (7.98) 


Consider now the range 0<W<2/2 for the parameter y, and for def- 
initeness suppose a>b. 
From (7.95) to (7.98), we have the following values as y>0 and 


yor /2: 


y—0: 
coa, ῥ, 
c’—0, 0, 
pO, 0, 
£0, 0, 
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Yor /2 

co>Va?+b?, 0, 

c’0, a ; 
Vath? 

u—0, = a 3 

sae 0, 

soVa+ 1, - 9’ 

Var+b? 


The first solution gives points A,B on the x, and x, axes and suggests a 
distorted but reasonable outgoing wavefront 5, as shown in Fig. 7.14. The 
second solution is surprising. In both limits y>0, 7/2 we have 0 and 5 
finite. Thus we have points P,Q on the x axis. At P, ~=a/2 so the 
wavefront is tangential to the axis; at Q, ~=0 so the wavefront is 
perpendicular to the axis. Between y=0 and 7/2 there must be a maxi- 
mum or minimum of ξ. Since ἔξε μ-Ὲ Ψ, this occurs at the value of Ψψ given 
by 


Fig. 7.14. Wavefronts in magnetogasdynamics. 
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XO 


Fig. 7.15. Wavefronts in magnetogasdynamics. 


from tanu=c’/c, the condition may be written 


0" (y) + ely) =0. 


It may be shown that the wavefront has a cusp at this point. It may be 
shown also that ξ is negative. Thus the second front has the remarkable 
shape 5, shown in Fig. 7.14. Even though ~>0 and the wavefront is 
locally moving with a component in the positive x, direction, the energy 
propagation has a component in the negative x, direction, and as a 
consequence this wavefront appears below the x, axis. This is a striking 
example of the difference between the wavefront velocity and the ray 
velocity, between the phase velocity and the group velocity. 

The complete wavefronts are symmetrical in the x, and x, axes and 
the complete picture is presented in Fig. 7.15. 


CHAPTER 8 


Shock Dynamics 


The discussion of the wave equation has brought out most of the main 
ideas in the linear theory of hyperbolic waves in two or three dimensions, 
and we turn now to nonlinear effects. For plane waves in uniform media, it 
was possible to give a thorough treatment of the nonlinear theory. How- 
ever, it required sophisticated ideas and methods in contrast to the linear 
theory which was almost trivial. In more dimensions or in non- 
uniform media, where even the linear theory becomes complicated, we 
should expect considerable difficulties in the analysis. Cylindrical and 
spherical waves still involve only two independent variables, but some 
complication arises because their equations have nonconstant coefficients. 
Plane waves in nonuniform media are similar. In the general case of two or 
three dimensional propagation, we have to deal with more independent 
variables and the geometry becomes even more involved. It is not surpris- 
ing therefore that we have to resort to approximate methods. Indeed, the 
only exact analytic solutions are similarity solutions for special problems, 
and these usually require numerical integration of the reduced equations. 
The similarity solutions for cylindrical and spherical waves and for waves 
in nonuniform media have been discussed in Section 6.16; others will be 
referred to below. Apart from similarity solutions, one must use approxi- 
mate theories or numerical methods. This chapter and Chapter 9 are 
devoted to some of the approximate theories that have been developed for 
shock propagation in these circumstances. The description is for shock 
waves in gases, but the ideas and mathematical procedures may be used 
for analogous problems in other fields. 

The most obvious type of approximation is for those problems that 
can be treated as small perturbations to simpler problems with known 
solution. For example, a plane shock propagating through a slightly 
nonuniform medium or along a slightly corrugated wall can be analyzed as 
a perturbation of the uniform case. A problem like this will be analyzed in 
detail below, since it is needed in another connection, and others will be 
indicated. But the perturbation procedures of this type generally are 
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obvious, and the resulting mathematical problems, although often difficult, 
no longer involve new concepts about the behavior of waves. We again 
draw the line at developing purely mathematical methods and stress, 
rather, approximations that are intimately related with the wave propaga- 
tion. 

In an intuitive way, we can say that the difficulty in these problems is 
due to the combination of two effects: the shock is adjusting to changes in 
the geometry (or in the medium) at the same time that it is coping with a 
complicated nonlinear interaction with the flow behind it. Nonlinear plane 
waves are free of the first, and linear nonplanar waves are free of the 
second. In the more general case, if one of the effects can be dealt with 
fairly simply so that emphasis can be placed on the other, there is hope for 
an approximate theory. 

This chapter concerns problems where the nonlinear geometrical 
effects play the biggest role and the interactions with the flow behind are 
not responsible for the major changes in the shock motion. “Shock 
dynamics” seems to be a convenient name since the motion of the shock is 
stressed over the dynamics of the whole fluid flow. In the next chapter we 
consider problems where the opposite emphasis is appropriate. They are 
problems of weak shocks and the idea is that for weak shocks the 
geometrical effects, although important, can be taken unchanged from the 
linear theory. Then the nonlinear analysis consists of introducing, within 
that geometrical framework, the crucial effects of the nonlinear interaction 
with the flow. 

In both cases, the approximations become intuitive and are based on 
incorporating known effects into a mathematical description. The 
“Justification” comes from checks on particular cases that can be handled 
precisely and from comparison with observations. The problems are too 
hard for the more routine approximation procedures. 

In the discussion of shock dynamics, the description will be based on 
the picture provided by geometrical optics. There the geometry is in terms 
of wavefronts propagating down ray tubes and, for an isotropic medium, 
the rays are the orthogonal trajectories of the successive positions of the 
wavefront. For a shock moving into a gas at rest the medium is isotropic. 
Therefore, by analogy, we shall introduce rays orthogonal to the successive 
positions of the shock, and study how an element of the shock propagates 
down a ray tube. However, there is a crucial difference between a shock 
and a linear wavefront. The shock velocity at any point depends on its 
strength, so the geometry cannot be mapped out in advance, independently 
of the determination of the strength of the wave. The two are coupled 
together and the ray tube geometry itself has to be determined at the same 
time as the shock strength is determined in terms of the ray tube area. The 
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equations corresponding to (7.60) and (7.61) are coupled together. It is as if 
c in (7.60) depends on ᾧρ. But we have to analyze the whole thing again. 

As a building block in this theory, we shall need to study propagation 
down a given tube of arbitrary cross section. This is interesting for its own 
sake, and of course propagation in a wedge-shaped channel is identical 
with cylindrical waves, propagation in a cone is identical with spherical 
waves, so that we have an opportunity for further discussion of those 
problems. Propagation of a plane wave in a nonuniform medium is similar, 
and some details of that are also included. 


8.1 Shock Propagation Down a Nonuniform Tube 


We consider the one dimensional (hydraulic) formulation for flow in a 
tube of given cross-sectional area A(x). Even in a uniform tube, the shock 
can be changing in a complicated way due to interactions with the flow 
behind it, as described in the piston problem of Sections 6.8 and 6.11. But 
we are concerned with isolating as much as possible the effects due to a 
nonuniform A(x) and, effectively, want to take the very simplest piston 
problem. That is, we want to formulate the problem in such a way that the 
shock would continue with constant speed in the case A(x)=constant. To 
do this, assume that 


A(x) =A,=constant in x <0, 


and that the shock is initially moving in this section with constant Mach 
number M,. We may imagine the shock to be produced by a piston 
moving with appropriate constant speed far back in the uniform section. 
The piston is still providing the thrust to keep the shock moving, but there 
are no changes due to this; the changes are due entirely to the cross- 
sectional area. The problem then is to determine how the Mach number of 
the transmitted shock depends on A(x) in x >0. 

The flow is not strictly one dimensional but if the cross section A(x) 
does not vary too rapidly, the equations obtained by averaging across the 
tube will provide a good approximation. They are 


A(x) 
ὁ πρῶτες =(), (8.1) 
u,+ uu, + τ, =0, (8.2) 


DP, + up,,— a*(p, + up,) =0. (8.3) 
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The area change appears only in the continuity equation (8.1), and this 
equation follows immediately from conservation of mass in the form 


(pA),+ (puA),=0. | (8.4) 


We note that for propagation into a wedge with apex at Xp, 


Α΄ (χ 24 
A(x)&(%-x), “ a rt (8.5) 
and for a cone 
A(x) (x 9—x)’, aX) Seales (8.6) 


A(x) 7 (X9- x) 


With r=(x )— x) and the sign of u reversed (to be measured positive for 
increasing r), the equations are then identical with those for cylindrical and 
spherical waves in (6.132)}(6.134), and they are exact. The fact that the 
equations are exact in this case indicates that the true criterion for the one 
dimensional formulation is really that the curvature of the walls in the x 
direction should be small. But the question of precise validity seems to be 
one which has never been completely investigated. 

Figure 8.1 is the (x,7) diagram for the problem, with the origin of ¢ 
taken as the time when the incident shock arrives at x =0. For t<0 the 


Py »P} Uy 


Fig. 8.1. The (x,#) diagram for a shock entering a nonuniform region. 
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flow consists of uniform regions separated by the moving shock. We take 
u=(, p= Po, P= Pp in the undisturbed state ahead of the shock, and we take 
uU=uU,, P=P,, P=p, in the initial uniform state behind it. The quantities 
U,,P,,p, are determined in terms of po, po, My by the shock conditions. 
When the shock reaches x =0, disturbances to this state propagate on the 
particle paths P and the negative characteristics C_. The C_ may have 
positive or negative slope depending on whether u,>a, or u,<a,; the 
latter case is shown in Fig. 8.1. The problem is to determine these 
disturbances from (8.1)+(8.3), together with the modifications of the shock 
position and strength. The shock conditions are 


2 ] 
μπα τ {Μ-- a7), (8.7) 
P=poae Peer) eae aad (8.8) 
PL tl y(yt1) J 
(y+1)M? 
= 8.9 
ee ey ee (8) 


and, when required, the sound speed is given by a”= yp/p. 


The Small Perturbation Case. 


The one dimensional formulation is not limited to small changes in 
A(x) itself, since large changes may be attained over large enough dis- 
tances, even though the derivatives of A(x) are small. However, in the case 
when the change of A(x) from A, remains small, 


0 

we may assume that the disturbances to the state u,,p,,p, behind the shock 
and the change in shock Mach number are correspondingly small. We may 
then solve the problem as a perturbation on the solution for the uniform 
tube. The equations (8.1)-(8.3) and the shock conditions (8.7)(8.9) are 
linearized about the state u,,p,,p,. However, it should be noted that there 
is no assumption that p,—ppo,..., are small; the shock is of arbitrary 
strength. 
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The linearized equations are 


ριμγ4΄ (χ) es 


0, + Up, + pu, + A συ, 
0 


Ι 
u, + μι, + ee (8.10) 


Py t Uy Py — at(p, + U1P,) =Q, 


where, to save writing, we leave it to be understood that p, is interpreted as 
(9 —p,),, A(x) as {A(x)— 40}΄, and so on. The general solution is readily 
obtained since the equations are linear with constant coefficients; the most 
significant derivation is via the characteristic form of the equations. The 
characteristic equations for (8.10) are 


[8 ὃ , A(x) 
Cy! ay + (uit a1) 5. {δι 91") ak yee (8.11) 


ὃ A'(x) 
cf τί a) | (peru) + atu, a =0, (8.12) 


0 0 pe Ses 
P: = + uy 3 ~ Ἰὼ ajp)=0, (8.13) 
and the general solution taking each in turn is 


ριαΐμ A(x) —Ag 


μ τ a, Ao 


(p—P,)— aj(o— p,) = H(x— 4,0), (8.16) 


(p—P\)—p\a,(u—u,)= — ἘΠ {Χ8Ξ (μι, -- a,)t}, (8.15) 


where F,G, and ἢ are arbitrary functions. In the linearized form, because 
of the constant coefficients, we have been able to carry out the integration 
on the three families of characteristics explicitly, and the characteristics 
have been approximated by the straight lines, x —(u,+a,)¢=constant, 
x — u,t=constant. The three arbitrary functions are to be determined from 
the initial conditions of the problem and the boundary conditions at the 
shock. 
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First, and most decisive, F must be identically zero. This is because 
the C, characteristics behind the shock, that is, the lines x —(u,+a,)¢<0, 
all originate in the uniform region where u=u,,p=p,,p=p,,A Ξ 4, (see 
Fig. 8.1); hence from (8.14), F=0. It is in this crucial step that modifying 
disturbances overtaking the shock are excluded. It should also be stressed 
that in this perturbation analysis the conclusion F=0 15 a strict deduction 
from the formulation of the initial conditions, not an intuitive argument. 


The other two functions G and AH are not zero: they describe the 
disturbances on the C_ characteristics and the particle paths P shown in 
Fig. 8.1. These originate at the perturbed shock and the three shock 
conditions are sufficient to determine G,H, and the change in shock Mach 
number (from which the change in shock position may also be deduced). 
The functions G and H are of subsidiary interest. The main result we want 
is the change in shock Mach number, and this can be determined without 
involving G and H. The shock conditions give the perturbations p — p,, 
u—u,, at the shock in terms of the change in Mach number M— M). From 
(8.7) and (8.8), they are 


4p a5 2 
ἘΠ Μι(Μ- Μὴ, κι τΞ 


Ρ-ριΞ 


2 
Ω͂ a u A-A 
απ Yeap © erent aS yen 1 pa re 0 
yt1 y+ M3 | P0% 


(8.18) 


The expressions for Μ᾽» p,, a, in terms of M, are given by (8.7)-(8.9) with 
ΜΞ Μ9. Then, after some algebraic manipulation, (8.18) becomes 
Ag 


= —8(M)(M—M)), (8.19) 


where 


M 2 1-p’ [ I 
M)= 1+ —— 1+2u+—}, (8.20 
2 
>. (y-1)M'+2 
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The quantity p is in fact the Mach number of the shock relative to the flow 
behind it. 

When needed, the expressions for G and H may be found by applying 
(8.15) and (8.16) at the shock; in the small perturbation theory, it 1s 
consistent to apply these conditions at the unperturbed shock position 
x=a Mot, since the errors would be second order. The details may be 
found in the paper by Chester (1954), where small perturbation results 
were first derived. It is interesting that the area term in (8.15) changes sign 
at u,=a,, and in fact is singular at u,=a,. Yet (8.19) shows neither a 
change in sign nor any singularity. Friedman (1960) has investigated the 
case u,=4a,, which is the case of exactly sonic flow behind the shock. He 
shows that small nonlinear effects must be incorporated into the distur- 
bances on the C_ in order to obtain a uniformly valid solution, but that 
(8.19) is unaltered. 


Before discussing (8.19) in detail, we go on to an important extension. 


The Finite Area Changes; The Characteristic Rule. 


For a tube which varies slowly but which accumulates large changes 
in A(x) over a sufficiently large length, we might break down the problem 
into successive small lengths in each of which the change in A is small. In 
each such small length of tube, it would be admissible to linearize about 
the local conditions and develop a small perturbation theory as in (8.14)— 
(8.16). But it would no longer be strictly valid to take F=0, because the 
entry conditions into each of these subsections would not be a uniform 
state. After a number of successive sections the errors might accumulate. 
However, if we neglected this, (8.19) would apply to each subsection with 
A, and M, taken to be the area and Mach number at the entry to the 
subsection. But then the theory is extremely simple. We are saying, in 
effect, that (8.19) is the differential form of a functional relation M 
= M(A): 


ap ee) (8.22) 


and we do not have to discuss the subdivision into small subsections 
explicitly at all! Moreover, (8.19) was itself merely a substitution of the 
shock conditions into the characteristic relation on the C., characteristics. 
So the whole derivation can be put into the following characteristic rule: 


Write down the exact nonlinear differential relation for the C, 
characteristics. Substitute the expressions for p, p, u, a in terms of M from the 
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shock conditions. The resulting differential equation gives the variation of M 
wih x. 


Although we have all the ingredients already, let us follow this 
prescription to emphasize its simplicity. The basic equations governing this 
particular problem are (8.1)-(8.3). The characteristic equation for the C, 
characteristic is 

dp du , pau 1 dA 


Bade = utaA dx 


=(. (8.23) 
The shock conditions are given in (8.7)-(8.9). On substitution we have 
g(M)——+——— =0, (8.24) 


where g(M) is given by (8.20). It is convenient to write this as 


1dA_. 
a ACM hor ἘΞ (8.25) 
where 
7 2 Te 1 
λα [τες 2 .-Ξ Mesedil (8.26) 
—1)M*+2 
jE (8.27) 
2yM*—(y—1) 


The reason for this choice is that A(M) varies little over the range of Mach 
numbers. The limits are 


Mol, d>4, (8.28) 


dy 


Tz 75.0743 for y= 1.4. (8.29) 


Μ-»οο, Aon=l+=+ 


Formula 8.25 was first obtained by Chisnell (1957), who used a 
different approach. The area distribution A(x) was approximated by a 
sequence of discontinuous steps and the solution was built up from 
analysis of the elementary interaction of the shock at each discontinuity. 
Αἴ each interaction the shock is transmitted with a modified strength, and 
disturbances are reflected (the C_ and P disturbances in our analysis). But 
the reflected disturbances are themselves re-reflected when they travel 
back through the earlier steps; the re-reflected waves overtake the shock 
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and contribute to the later interaction. If all re-reflected waves are neg- 
lected, (8.25) follows. Chisnell analyzed the effects of all singly re-reflected 
disturbances and found that their total modification to (8.25) was much 
smaller than their individual contributions. Earlier Moeckel (1952) had 
applied similar ideas to steady oblique shocks in nonuniform supersonic 
streams. The nonuniform stream was replaced by layers, in each of which 
the flow quantities were constant, with surfaces of discontinuity between 
the layers. The solution was built up from the elementary interactions at 
the interfaces. 

Although the Moeckel-Chisnell approach offers in principle the 
possibility of successive improvement by including more and more 
multireflections, it does not seem practicable to go beyond the first 
re-reflections. It is also difficult to assess to what degree of approximation 
the equations (8.1)-(8.3) have been solved. However, the relatively small 
modifications from the first re-reflections indicate that (8.25) may be 
unexpectedly good. This is indeed the case, as we shall see below. 

When the quick derivation of (8.25) by the characteristic rule occurred 
to me, I hoped also that a full analysis of the approximation could be 
based directly on (8.1)-(8.3). So far this has not been completed! To see 
what is involved, note that the characteristic equation (8.23) may be 
written 


P; 
uta 


+u,] pe ag (8.30) 


+p, + 7 = 


t 
uta 


This is exact and holds throughout the flow since it is just a combination 
of the basic equations (8.1)-(8.3). If (8.23) is applied to a shock moving 
with velocity U, we are claiming that 


pa*u Α΄ (χῚ - 
uta A(x) i 


Ρι 


Uu, | : 
ve +p. tpa( τε τας} + 0 (8.31) 


is a good approximation at the shock. Taking (8.30) and (8.31) together we 
see that the approximation is based on the assumption that 


Ι Ι 
— 32 
( i. ea Ἰώ" pau) (8.32) 
is relatively small at the shock; that is, this expression is small compared 
with p,/U. The smallness of the first factor would correspond to the 
thought that the C, characteristic in Fig. 8.1 is fairly close to the shock, so 
that we are merely transferring the relation that holds on a C, to the 
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shock. However, whereas (u+a—U)/U is zero for M=1, it tends to 0.274 
(for y=1.4) as M-—>oo. The consequences of the characteristic rule are 
sometimes a hundred times more accurate than this! In the case of a 
cylindrical or spherical implosion noted below the relative error is about 
0.003. Thus although the first factor may contribute a little to the accuracy, 
the rule works well because 


P, + pau, 


8.33 
rs (8.33) 


is extremely small at the shock. Although some further discussion was 
presented in the original paper (Whitham, 1958), no really satisfactory 
explanation of this was found. Of course we know that the result is correct 
in the small perturbation theory, and we check from (8.14) with F=0 that 


P, + p\4,u, =9, 


in that theory. 

With only the partial justification provided by the small perturbation 
case and the Moeckel-Chisnell analysis, the accuracy of (8.25) has been 
confirmed by comparison with known solutions. First for weak shocks, 
M= 1, we have \=4; hence 


M-1a“A7'/2, (8.34) 


This is the correct result of geometrical acoustics for weak pulses, as M— 1 
is proportional to the strength of the pulse. Secondly, we may apply (8.25) 
to converging cylindrical or spherical shocks by taking A&x g— x or 
(χο- x)’, respectively, and compare the results with Guderley’s exact 
similarity solutions described in Section 6.16. For infinitely strong shocks, 
λ tends to the value n given in (8.29), and (8.25) becomes 


πὰΜ 1 44. 7, yaar, (8.35) 


Therefore the rule gives 


Μααν". ἰἴοζγ cylindrical shocks, 
(8.36) 


Mar-?/" for spherical shocks. 


A comparison with the exponents from the exact similarity solution is given 
in Table 8.1. The accuracy is amazing in view of the simplicity of the 
approximate theory. Among other things, it shows that converging shocks 
are reacting primarily to the changing geometry as assumed in the 
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approximate theory, and are very little affected by the further disturbances 
from the source of the motion; the strength of the initial shock enters only 
through the constants of proportionality in (8.36). This would not be true 
for outgoing shocks. They would slow down due to the expanding 
geometry and the continuing interaction with the flow behind over large 
distances would be important; this approximate theory is not appropriate 
for such problems. 


TABLE 8.1 
Cylindrical Spherical 
Ύ Approximate Exact Approximate Exact 
6/5 0.163112 0.161220 0.326223 0.320752 
77,5 0.197070 0.197294 0.394142 0.394364 
5/3 0.225425 0.226054 0.450850 0.452692 


Another point of interest is that according to the approximate theory 
the exponent for the spherical case is just double the exponent in the 
cylindrical case. This is not true, however, in the exact similarity solution, 
although it is very nearly so. 

With the partial justification mentioned earlier and these independent 
checks, we conclude that the characteristic rule gives a good simple 
approximation to problems of this type and it may be used with confidence 
in a wide range of problems. 

For general M, the solution of (8.25) may be written 


A f(M) 


ΒΩ aa H(t) =er0| - ὙΠ 
0 


f( Mo)’ M*—| 


dM | (8.37) 
In particular this formula may be used to extend the results for converging 
and spherical shocks to include shocks of intermediate strength. Of course, 
as the center is approached, A—>0 and M->oo. Table 8.3 presents values of 
f(M) for y=1.4. (See p. 288.) 

In the next section the characteristic rule will be used for the problem 
of a shock propagating through a nonuniform density layer, and further 
examples may be found in the original paper (Whitham, 1958). It will also 
be the basis for the geometrical treatment of two and three dimensional 
shock propagation in Section 8.3. 
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8.2 Shock Propagation through a Stratified Layer 

Here the method is applied to the one dimensional problem of a plane 
shock moving in the x direction through a given equilibrium distribution 
u=0, p=Ppo(x), p=po(x). If po(x) is not constant, there must be a body 


force in the problem to maintain the pressure gradients and we include this 
in the equations. The one dimensional equations are 


p, + up, + pu, =0, 
l EF 
u,+ uu, + Fs oF (8.38) 
p, + up, — a*(p, + up,) =0, 


where % is the body force per unit mass. In the atmosphere or for 
propagation in the outer layers of a star, & would be the gravitational 
acceleration. In equilibrium, the distribution of p (x) and p,(x) must 
satisfy 


“9. αὶ (8.39) 


and the entropy distribution must be given to complete the determination 
of Po(x),Po(x). In the atmosphere %& = — g and we have, for example, 


Po(x) = po(O)e~ 8/270 (isothermal), 


= pg '(x) =ce—gx (isentropic), 


as discussed in Section 6.6. 

We now apply the characteristic rule to the propagation of a shock 
through such a layer, remembering that the theory applies only for the 
local effects of the layer and should be used only when additional effects 
would be small. The appropriate characteristic relation may be written in 
differential form as 


§ dx =0 on 


ies Ξε 
ἐδ oe aaa dt 


uta. 
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But we apply it along the shock. That is, we use the differential equation 


dp du pa π᾿ 
dx * PF ay wea tS Ce 


with u,p,p,a expressed in terms of p,(x), P(x), and the shock Mach 
number M(x). In general, numerical integration will be required, but the 
result for strong shocks may be obtained analytically. For strong shocks 
[see (6.110)], the shock conditions simplify to 


2 y+] y 2γ(γ--Ἰ) 
U, p= pall: ΞΞ----- - θὴ; C= 2 - ΄΄΄ “2 
y+1 y+1"° y-1°° ρ (y+1) 


where U is the shock velocity. In this limit U is relatively large and the 
third term in (8.40) is negligibly small compared with the other two; the 
body force & enters indirectly through its control of Po(x). Equation 8.40 
reduces to 


where 


p=(2+ -“:- (8.41) 


Hence 
Uxpy”, pxp, 38, (8.42) 


For y= 1.4, B=0.21525. 

These results allow a further check to be made against exact solutions. 
Sakurai (1960) investigated similarity solutions to this problem in the case 
where pypxx°*. He found Uxx~* and determined the value of A for 
different values of a. His values of A/a are given in Table 8.2, and they are 
compared with £. Although not quite as good as for the implosion 
problem, the approximation is still remarkably close. 

The limitation to problems where Jocal modification of the shock is 
intense must be borne in mind. For an exponential fall off in density, 
similarity solutions can also be found and compared with this approxima- 
tion. The comparison has been made by Hayes (1968) and the difference in 
the exponents is as much as 15%. We attribute this to the fact that an 
exponential change in density does not have the strong local effect of a 
power law with p,)->0 at finite x. 
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TABLE 8.2 
Ύ a=2 a=1 a=1/2 B 
5/3 0.21779 0.22335 0.22820 0.23608 
7/5 0.19667 0.20214 0.20704 0.21525 
6/5 0.16545 0.17040 0.17498 0.18301 


Chisnell (1955) originally studied the problem of this section from the 
successive interaction approach, in the case p)=constant, ὅ =0, and found 
the small corrections due to re-reflections. As before there is some benefi- 
cial cancellation. 


8.3 Geometrical Shock Dynamics 


We now turn to the development of the approximate geometrical 
theory for shock propagation in two or three dimensional problems when 
there is no special symmetry (Whitham, 1957, 1959b). We consider a shock 
propagating into a uniform gas at rest and, based on the experience with 
geometrical optics for linear problems, we introduce “rays” defined as the 
orthogonal trajectories of the successive positions of the shock. As a 
specific example, consider the case of shock diffraction around a corner in 
Fig. 8.2. The shock positions are shown by full line curves and the rays by 
broken lines. The idea is to treat the propagation of each element of the 
shock down each elementary ray tube as a problem of shock propagation 
in a tube with solid walls. The equivalence would be exactly valid if the 
rays were particle paths, since solid walls are particle paths in inviscid flow. 
However, this is only approximately true. The shock conditions require the 
induced flow immediately behind the shock to be normal to it, but as the 
distance from the shock increases the particle paths will deviate from the 
rays in general. So a definite approximation is involved, and it is one that 
may be quite severe. However, it is only by this step, or a similar one, that 
the geometrical effects can be extricated from the whole complicated flow. 
In the problem of diffraction around a corner (Fig. 8.2) the wall itself has 
to be both a ray and a particle path along its entire length, so there is some 
additional resistraint on the deviation between the rays and particle paths 
further back. 

The accuracy of this type of approximatiom is hard to assess in 
advance and higher approximations are virtually impossible. As justifica- 
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Fig. 8.2. Shock positions (solid lines) and rays (dashed lines) in the diffraction of a shock 
around a continuous corner. 


tion we shall see that the theory does reduce precisely to geometrical optics 
for linear problems and the nonlinear results will be compared both with 
other theoretical results for special cases and with experiments. It is 
perhaps appropriate to comment that approximations which are easy to 
assess usually involve small effects. Here we are concerned with large 
effects in extremely difficult problems. 

The ray tube approximation is independent of how the propagation in 
each ray tube is handled. However, we assume that the local Mach number 
will be a function of the ray tube area, and in the absence of any other 
explicit formula we adopt the results established in Section 8.1 and use the 
relation in (8.37). 
| It is convenient to specify the shock position at time ¢ in the form 


a(x) = dol, (8.43) 


where ay is the undisturbed sound speed. The successive shock positions 
are then given by the family of surfaces a(x)=constant. It is clear that in 
principle we have a procedure to determine the function a(x). First of all, 
the shock Mach number at any point can be determined in terms of a(x) 
from (8.43). Secondly, it must be possible to determine all the geometry of 
the rays from the function a(x), since it specifies the family of shock 
positions: this determination gives the ray tube area. The A-M relation 
then provides the bridge to derive an equation for a(x). 

The normal velocity of any moving surface S(¢,x)=0 has been noted 
in (7.63). If this is applied to S=agt— a(x), the shock velocity is found to 
be U=a)/|Va|. Therefore 


1 
M= Wal’ (8.44) 


Sec 8.3 GEOMETRICAL SHOCK DYNAMICS 279 


To study the geometry of the ray tubes it is convenient to introduce a unit 
vector 1 for the ray direction at any point and a function A related to the 
ray tube area. The definition of | is clear and it is given in terms of a by 
Va 

l= Wal’ (8.45) 
since the rays are normal to the surfaces a=constant. The definition of A 
may need a little amplification. We want to introduce a finite function of 
position that can be used to measure the area of arbitrary infinitesimal ray 
tubes. To do this we consider any particular ray and construct a narrow 
ray tube around it consisting of a bundle of neighboring rays. We may 
then introduce the ratio of the cross-sectional area at any location along 
the ray tube to the area at a standard reference section. In the limit as the 
maximum diameter of the ray tube tends to zero, this ratio approaches a 
finite limit and the limit function is taken as the function A along that ray. 
It is defined in similar fashion along each ray and so becomes a function 
of position. For any infinitesimal ray tube, A is now proportional to the 
ray tube area rather than being the area itself. However, in (8.37) only the 
ratio of areas appears so that this quantity A is still related to the local 
Mach number by 


= ; (8.46) 


For the same reason the original reference point for the ratio of cross- 
sectional areas along the ray tube drops out and is replaced by the initial 
condition A =A, for M=M,, which is incorporated in (8.46). In fact, A 
can be taken to be any finite function proportional to the infinitesimal ray 
tube area along the ray. Different constants of proportionality on different 
rays would be compensated for by different values of Ao. 

- The relation of A to the function a(x) defining the shock positions 
comes essentially from the fact that increases in A along a ray are related 
to the divergence of the ray vector | in (8.45). In fact, we now show that 


] 
.Φ —— = . 7 
V (4 ) 0, (8.47) 
and this can also be written 
1 dA 1-VA 
ὡς — ee iE Oo 41, 48 
A ads A ve (8 ) 


To prove (8.47), the divergence theorem is applied to the volume V in a 
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Fig. 8.3. Ray tube geometry. 


narrow ray tube between the two successive shock positions as in Fig. 8.3. 
We have 


] ἐν 
v-(4 Jav= [ bv 45, 8.49 
I, A S,;+E+5, 4 ie 


where = refers to the sides of the tube, S, and S, refer to the ends, and ν is 
the outward normal. On the side walls 1-y=0 by definition of 1 so the 
contribution from = is zero. On S,, l-y= +1, while on S,, l-y= —1. Hence 
the right hand side of (8.49) reduces to 


[ἄς [Ὁ 
s, A s, A 


From the definition of A, both integrals tend to the same value so the 
difference tends to zero as the diameter of the tube shrinks to zero. 
Therefore the integral in (8.49) is zero. Since the choice of V is arbitrary, it 
follows that (8.47) holds everywhere. 

Equations 8.44-8.47 provide a partial differential equation for a(x). 
Collecting the results we have 


Marea: (8.50) 
v-(fva)=0, (8.51) 

A ΚΑ) 

Ἂς ΚΜρ᾿ ee 


where l= Va/|Va|= © Va has been used in obtaining (8.51) from (8.47). 
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This is a convenient form for comparison with the results of geometri- 
cal optics in linear theory. The linear limit corresponds to M->1 and the 
linear theory replaces (8.50)-(8.52), respectively, by 


|\Va|=1, (8.53) 
v-(4va)=0, (8.54) 
7 8} (8.55) 


Note that Μ is replaced identically by unity in the first two, where the 
geometry is concerned, but M—1 appears as a measure of the strength 
(which is small) in (8.55). In this way the geometry is uncoupled from the 
determination of the strength of the wave. The flow quantities such as 
z=(p—Pp)/Po ate proportional to (M— 1) and the linear theory would 
refer to z rather than M—1. From (8.54) and (8.55) we have 


V-(z2Va) =0. (8.56) 


Equation 8.53 is the same as the eikonal equation (7.65), bearing in mind 
the normalization of a by the sound speed, and (8.56) is the same as the 
transport equation (7.66) with Ζα ᾧ,. In the linear theory (8.56) was 
obtained first and then interpreted as zx A~'/?, which corresponds to 
(8.55). Our arguments here have led directly to what was the 
“interpretation.” The main point, however, 1s that the theory developed 
here does reduce to the linear theory in the appropriate limits. The crucial 
difference in the nonlinear theory is the coupling of the strength z with M. 
Even for weak shocks, with M—1<1, this coupling can make important 
qualitative differences. 


8.4 Two Dimensional Problems 


In two dimensions the shock positions and rays form an orthogonal 
coordinate system as shown in Fig. 8.4, and for some purposes it 15 
convenient to formulate the equations in terms of these intrinsic 
coordinates. The successive shock positions are already described by the 
family of curves a=constant, and we introduce a function B(x) to describe 
the rays as the family B=constant. The required equations, using a and B 
as independent coordinates, can be derived by direct transformation of 
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equations (8.50) to (8.52), but it is instructive in bringing out further points 
in the geometry to give an independent derivation. (This was, in fact, the 
first derivation of the theory.) 


Fig. 8.4. Line elements in shock dynamics. 


In the description based on the net of curves a=constant, B 
=constant, the geometry is closely tied to the line elements for increments 
da,dB in the coordinates, and the ray tube geometry is introduced via the 
coefficients for the line elements. The line element for an increment 48 will 
be A(a,f)dB for some function A. This function A is clearly proportional 
to the width of the ray channel between rays B and β- α4β. In the two 
dimensional problem ray tubes are of constant depth in the third dimen- 
sion; hence A is proportional to the ray tube area and may be used as in 
the first formulation. An increment da corresponds to a change of shock 
position in time dt=da/a,. Therefore the distance traveled is Udt= M da. 
This shows that the line element for the increment da is Mda. The general 
line element for neighboring points is given by the metric 


‘ds? = M*da? + A7dp?. (8.57) 


The functions M and A in such orthogonal coordinates are not arbitrary 
functions of (a,8). They satisfy a differential equation which follows 
because we know that the 2-space described by (8.57) is, in fact, flat. The 
curvature calculated in terms of M and A must vanish. Another way of 
saying this is that M and A must be such that (8.57) can be transformed 
into 


ds? = dx* + dy’. 
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The appropriate condition, which will be derived below, is 


d {1 0A d{1 0M 
Δ (E4)+ δ. [1 8Μ-Ὁ (8.58) 
When the A-M relation is added we have a complete set of equations to 
determine A(a,8), M(a,8). From these the rays and shock positions can 
be determined as functions of x and y. 

To establish (8.58), consider the curvilinear quadrilateral PQRS in 
Fig. 8.4 with vertices (a,8), (a+ 6a,B), (α,β- δβ), (a+ δα,β- δβ). Let 
6(a,B) be the angle between the ray and a fixed direction, the x axis say. 
Since the sides PS and QR are of length 4δβ and (A + 4,,α)δβ, respec- 
tively, and the distance between them is Méa, the change in ray inclination 
from P to S 15 


Hence 

ae (8.59) 
Since the inclination of the B curves is 9+42, a similar argument shows 
that 

06 1 ΟΜ . (8.60) 


Equation 8.58 follows by elimination of 9, but it will be more convenient to 
work with the pair of equations (8.59) and (8.60). The system is completed 
by the A-M relation 


(8.61) 


The equivalence of (8.59}{8.60) with (8.50)-(8.51) is easily established 
using : 


ΕΞ cos 6 a, = sin 9 
x M 3 oa M 3 
_ — sing _ cosé (8.62) 
B.=- B= 
x A 3 A > 


Once 0, M, A have been found as functions of a and β, the shock 
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positions may be obtained by integration along rays. On a ray 


Ox 


y 
Maa =cos 8, =sin 8. 


M da 


Therefore 


x= xo( B) + [ McosO da, | 
(8.63) 


a 


y=vo( B)+ J M sin@ da, 


give the position of the shock at time t=a/a, in terms of the position 
X=X(B), y=yo( f) at t=0. 

In general, the coefficient Ay/f(M,) in (8.61) may be a function of B, 
since both A, and M, may vary along the initial shock position a=0. But 
we may define a new variable β and a new A to eliminate this. The 
invariant quantity is the line element A df. If A=k()A, then 


AdB=k( B)AdB=A dB, 
where 


B= [κ(β)αβ. 


Thus any unwanted factor k( B) can be absorbed into a new β. We shall 
assume this has been done, unless otherwise stated, and we take A = A(M). 
In the diffraction problem of Fig. 8.2, the initial shock a=0 is plane and 
M,=constant. We choose β to be the distance from the wall in this 
uniform region. Hence A,=1 and 


I(M) 


ὩΣ 1. μὰ 


8.5 Wave Propagation on the Shock 


It is interesting that (8.59)-(8.61) turn out to be hyperbolic and 
represent a wave motion for disturbances propagating on the shock. A 
little thought shows that this should have been expected. The flow in the 
region behind a deforming shock involves two dimensional waves propa- 
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Fig. 8.5. Cylindrical waves produced in the diffraction of a shock. 


gating with the local sound speed relative to the local flow as sketched in 
Fig. 8.5. Our approximate equations describe in some way the trace of 
these cylindrical waves where they intersect the shock. 

The wave propagation on the shock is brought out by studying the 
characteristic form of the equations. When A=A(M) is substituted in 
(8.59) and (8.60), they become . 


OO ed eee) 
ὃ. 1 aM _ 
da A(M) Of 
The characteristic form is 
0 0 dM \ _ 
[ἃ το) 9: f τ J=0, (8.66) 
where c is the function of M given by 
Ὁ [-πΜ 
c(M)= ΤΩΣ (8.67) 


Since A’(M) <0, the characteristics are real and we have nonlinear waves 
propagating with velocities 


dp 
da 


=7c 


relative to the (a,8) mesh. These waves carry the changes of the shock 
shape and shock strength along the shock. The Riemann invariants are 
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given by (8.66). We have 


dM _ αβ᾽ 

θ-: [ rr = constant ae Fe =(C, (8.68) 
dM _ qp 

θ 1. ΞΞ- ςοηβίδηϊΐ on Ὁ (8.69) 


The wave motion is analogous in every way to the prototype of one 
dimensional nonlinear gas dynamics discussed in Chapter 6, and the ideas 
and techniques established there may be taken over to these waves propa- 
gating along the shock. 

The A(M) relation is derived from (8.25), which may be written 


Ade” ee (8.70) 
Hence 
; 1/2 
Μ΄΄--Ἰ 
Ac=( ------------ , 8.71 
ἔπ πο 


and the integral in the Riemann invariants is 


_ Mam (fxm) "7 
o(M)= [ aa [3] dM. (8.72) 


The explicit formulas for weak shocks, M—1<1, and strong shocks, 
M>1, will be useful. They are 


2 
A (M,-1) 
A~4, θὲ εἰς 
ο (M-1) as M—>1, (8.73) 
Μ--Ἰ δ 172 
ac~( “= , w(M)~2?/2(M—-1) 
and 
h~n=5,0743 for y= 1.4 4 {τ 
Ao M as Moo. (8.74) 


Ac~n7'/2M, | w(M)~n'/? log M 
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The characteristic relations are most easily obtained in the (α,β) 
coordinates, but in applications to specific boundary value problems the 
description in Cartesian coordinates (x,y) is sometimes preferable. It is a 
simple matter to transform (8.68) and (8.69) to this form. We note that 


+ y,dB /da 
dy _ Yat ¥adB/ do (8.75) 
dx x,+xgdB/do 


where x,=Mcos6, y,=Msin@, xg= —Asin@, yz=Acos6. Therefore the 
characteristics dB /da= +c become 


dy Es 
ms =tan(@+ m), (8.76) 
where 
1/2 
Ac : A 
2 — “-  : ---ι--ς.-. 8.77 


and the characteristic equations are 


& =tan(0+ m). (8.78) 


0+w(M)= tant Ci 
w(M ) =constan on Ος τς 


. Values of m(M) and w(M) are given in Table 8.3, which is taken from a 
paper by Bryson and Gross (1961). 

These equations could also be deduced directly from the two dimen- 
sional form of (8.50), (8.51). With a, =cos@/M, 2 =sin@/M, the equiva- 
lent set of equations in θ and M is 


0x\ M oy\ Μ (8.79) 
d {cos@ 0 (51:16) 
αἰ 4 ) ἫΝ A }=0 


It is straightforward, but longer, to show that the characteristic equations 
are the above. 


TABLE 8.3 


Characteristic 
Ray area A angle m The 
Mach no. M Axl0-N xy (degrees) integral w 

] oe) 0 0 
1-:000001 | 3:668749 +10 — 0-003 
1-00001 3:668672 + 8 -- 0-009 
1-0001 3:667902 + 6 0-403 0-028 
1-001 3°660213 + 4 1-280 0-089 
1-01 3:584696 + 2 4-002 0-283 
1-05 1-:310728 + 1 8-544 0-633 
1-10 2:946288 + 0 11-474 0-896 
1-15 1-:184152 + 0 13-142 1-097 
1-20 6:053638 — 1 14-843 1-266 
1-25 3°536658 — 1 15-958 1-414 
1-30 2:250720 — 1 16-859 1-547 
1-35 1-520662 — 1 17-604 1-669 
1-40 1:074028 — 1 18-231 1-728 
1-45 7:850741 — 2 18-766 1-887 
1-50 5:898186 — 2 19-228 1-984 
1-55 4-531934 — 2 19-630 2-077 
1-60 3:548150 -- 2 19-983 2-165 
1-65 2-822580 — 2 20-295 2-249 
1-70 2-276434 — 2 20-572 2-330 
1-75 1-858064 — 2 20-820 2-406 
1-80 1-532637 -- 2 21-042 | 2-480 
1-85 1:-276079 — 2 21-242 2-551 
1-90 1:071389 - 2 21-423 2-619 
1-95 9-063299 — ὃ 21-587 2-685 
2-00 7:719471 — 3 21-736 2-749 
2-05 6-615861 -- 3 21-872 2.811 
2.10 5:702352 -- 3 21-997 2:87] 
2.18 4.9040725 - - 3 22-111 2-929 
2:20 5 4-301517 -- 3 22-216 2:98 
2:2ὅ 3:761766 -- 3 22.312 3°040 
2-30 3:303423 -- 3 22-401 3-094 
2-40 2-576553 — ὃ 22-560 3-203 
2-50 2:037086 - 3 22-696 3-302 
2-60 1:6380023 -- 3 22-814 3:388 
2-70 1-318343 -- 3 22-916 9.477 
2-80 1:076566 -- 3 23-006 3-563 
2-90 8:868121 — 4 23-085 8:64 
3:00 7-363072 -- 4 23-154 3:°724 
3°20 5°184216 — 4 23-271 3°875 
3-40 3:740925 — 4 23-364 4-015 
3:60 2:757067 — 4 23-439 4-148 
3°80 2:069662 — 4 23-501 4.272 
4-00 1-578970 -- 4 23-552 4-398 
4-50 8-519558 — 5 23-647 4-660 
5:00 4:926060 — 5 23-710 4-900 
6-00 1:921342 — 5 23-788 5-314 
7-00 8-705958 — 6 23-832 5-672 
8:00 4-395269 -- 6 23:859 5-966 
9-00 2-408270 — 6 23-876 6-232 
10-00 1-407051 — 6 23-889 6-470 
15-00 1:786391 -- 7 23-917 7-385 
20-00 4:141420 — 8 23-926 8-033 
100-00 1-172427 —-11 23-937 11-67 
ee) 0 23°938 co 
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8.6 Shock-Shocks 


The function c(M) is an increasing function of M. Therefore waves 
moving in the positive direction and carrying an increase of M and @ will 
break in typical nonlinear fashion. From the earlier experience in 
analogous problems, we assume that a discontinuous jump in M and @ will 
be required; that is, the shock develops a corner as shown in Fig. 8.6. 
Within this approximate theory, we follow the usual philosophy of such 
“shock” discontinuities and derive jump conditions from the conservation 
form of the basic equations. These “shocks” in the waves on the original 
gas dynamic shock will be referred to as “‘shock-shocks.” The waves on the 
shock are interpreted as the trace of roughly cylindrical waves which are 
spreading out in the flow behind the shock. A shock-shock is the trace of a 
true gas dynamic shock in the flow behind the main shock. Thus it 
corresponds to the three-shock Mach reflection (described at the end of 
Chapter 6) which has been studied and investigated directly. We shall 
discuss the relation with Mach reflection in more detail later. 

The differential equations (8.59) and (8.60) of this theory are derived 
from the metric (8.57) of the shock-ray network. The corresponding finite 
form is required to deduce jump conditions. Consider the neighborhood of 
the discontinuity in two successive positions of the shock as shown in Fig. 
8.6. Let the difference in the a coordinates for the two shock positions be 
Aa, and let the difference in the B coordinates of the rays be Af. Let 
subscripts 1 and 2 refer to values ahead of and behind the discontinuity. 
Then, in Fig. 8.6, PO= M,Aa, QR= A,AB, SR= M,Aa, PS= A,AB. 


Fig. 8.6. Line elements for a shock-shock. 
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Expressing the distance PR in two alternative ways, we have 


(M Aa)’ +(4,AB)’=(M,Aa)’ + (44,Δβ}}. 


But the ratio AB/Aa is the shock-shock velocity C in the (a,f) 
coordinates; hence 
ΜΣ-- My 


Ce 8.80 
Aj— Aj ey 


The corresponding jump in θ is deduced from 


cot(6,—0,)=tan(RPQ+ RPS) 


_ A,C/M,+M,/A,C 
ἢ 1—A,M,/M,A, 


Substituting for C from (8.80), we have 
1/2 1/2 
(My ~ My) (Α4ϊ-- Αἢ 


A,M,+A,M, CL 


For the description in Cartesian coordinates (x,y), (8.80) is transformed to 
the equivalent forms 


1/2 


A, { Μ2-- ΜΞ 
{33} : i=1 οἵ 2, (8.82) 


tan (χ -- θ,) Μ|142- Α3 
where x is the angle of the shock-shock line with the x axis. 

[{15 assumed that the functional relation (8.61) between A and M still 
applies even for the sharp change in channel section at a shock-shock, the 
velocity C is determined by (8.80) in terms of M, and M,, and the jump in 
θ is determined by (8.81). Then for weak shock-shocks, M,—M,->0, it is 
easily checked that (8.80) reduces correctly to the characteristic velocity 
(8.67), and (8.81) reduces correctly to the Riemann invariant relation. 
However, for sufficiently strong shock-shocks the dependence of A, on M, 
will not be given accurately by (8.61) since the relation was derived on the 
supposition that the channel section varies slowly. Nor is it just a question 
of establishing the correct formulas relating M and A at an abrupt change 
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in channel section. In fact, these formulas have been found by Laporte 
(1954). In reality, as is known from the traditional discussion of Mach 
reflection, there is a third shock and a vortex sheet behind the main shock; 
in principle, then, one should take the additional relations from the 
analysis of the three-shock configurations. This would be a complication 
which does not seem to be worth pursuing in detail in view of the 
approximate nature of the theory. We might note, however, that if it were 
done, the A-M relation for the waves following the shock-shock would 
take the form 


A=k(B)f(M), 


where k( 8)=A,/f(M,) has to be found from the three-shock relations; it 
would be incorrect to continue the A-M relation through the shock-shock 
back to the initial position and take k= A,/f(Mo). The whole thing is like 
the question of entropy in ordinary gas dynamics, where first of all one 
assumes that p=p(p) and this leads to simple waves. But then, since 
compression waves break, shocks have to be considered and they involve 
entropy changes so that p is no longer a function of p alone; behind the 
shock the entropy is constant on each particle path. We have the 
analogous situation with A and M similar to p and p, and k playing a role 
similar to the entropy. The simpler theory of shock-shocks with A = A(M) 
in (8.80) and (8.81) is rather like neglecting entropy changes at gas 
dynamic shocks. This is known to give accurate results if the discontinuity 
is not too strong, and it is expected that the same will be true here. A 
comparison between these simpler shock-shock conditions and the three 
shock results for Mach reflection will be given in Fig. 8.11. It substantiates 
the view that the more elaborate treatment would not be worthwhile in the 
context of this approximate theory. 


8.7 Diffraction of Plane Shocks 


We now consider specific applications of the general theory and start 
with the problem of the diffraction of a plane shock as it moves along a 
curved wall. The geometry for a convex curve is shown in Fig. 8.2. The 
wall is a ray and the shape of the wall provides a given boundary value 
6= 6) on the wall. If we use the (a, 8) coordinates, the wall can be taken to 
be the ray 8 =O. In the first instance, θ᾽, is known as a function of distance 
s along the wall. However, if we pose 9=9@,(a) on the wall, we can 
determine the relation between a and s from the final solution. For simple 
shapes, such as sharp corners, this relation is not needed since 6,, just takes 
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constant values on the two sides of the corner, and the (a, β) description is 
simpler. For more general shapes, for which this implicit relation would be 
a nuisance, it is usually better to work in the (x,y) description and use the 
equivalent set of equations (8.78). 

The wall is initially straight with 6, =O and the shock is uniform with 
9=0, M=M,. We choose β as the distance from the wall in the uniform 
region so that Ay=1 and (8.64) applies. We choose a=0 as the initial 
position at the start of the corner. The complete problem then is to solve 
the equations (8.65) for initial and boundary values given by 


6=0, M=M, fora=0, 0<B< 0, 
θ-- θ.(α) for B=0, O0<a<oo. 


The propagation of the waves along the shock is analogous to one 
dimensional waves in gas dynamics. The displacement of the wall corre- 
sponds to a piston motion, and we may think of the relative displacement 
of the wall pulling or pushing the foot of the shock and sending out waves 
along it. A convex corner corresponds to the piston being withdrawn and 
sending out expansion waves, whereas a concave corner corresponds to 
pushing the piston in to produce compression waves. In either case, until 
shock-shocks are produced by breaking, the solution is diagnosed as a 
simple wave by exactly the same type of argument used in Section 6.8. The 
C_ invariant in (8.69) is constant everywhere since all the C_ originate in 


the uniform region ahead of the waves in which θ ΞΟ, M= M,. Therefore 


§—w(M)=w(M,) (8.83) 


throughout, where w(M) is given by (8.72). In particular, the Mach number 
M,, at the wall, which is perhaps the most important result, is given in 
terms of 6, without further calculation of the solution. We have 


6, =w(M,,) —w(M,). (8.84) 


For any particular @,, the corresponding value of M, is obtained by 
solving this relation using Table 8.3. 

In the simple wave, the values at the wall propagate out and remain 
constant on the C, characteristics shown in Fig. 8.7. If a characteristic 
variable τ is specified as the value of a at the point of intersection of the 
characteristic with the wall β =0, the simple wave solution is 


6=6,(7), M= M,,(8@,,), B=(a-rT)c(M, ). (8.85) 
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Fig. 8.7. The characteristics in the diffraction of a shock. 


The corresponding equations in the (x,y) description are 


§=6,,=tan~ 'y/,(§), M=M,,(6,), 
(8.86) 


y=y,(§) + (x—&)tan(, + m,). 


Expansion Around a Sharp Corner. 


For a sharp convex corner, θ jumps from 0 to a negative value θ᾽, and 
remains at that value. The corresponding Mach number at the wall jumps 
from M, to the value M,, given by (8.84). The disturbance is a centered 
simple wave (see Fig. 8.8), and the solution for M in the wave is found 
from 


Fa c(M), ¢(M,,) <= <e(My). (8.87) 
The corresponding θ is given by (8.83). Along the shock 
0 
7387 — sin 8, Tap 70088 


therefore at time t=a‘/a, the shock is given in terms of the parameter B by 


B 
M 
v= aM, cost, [ ui ) sind dB, 
 f( Mo) 


ΚΜ 
(8.88) 
Ρ f(M) 
|, 700) 


γθΞαΜ,,5θ,- cos@ dB. 
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Since M and @ are functions of the single variable β..α, it follows that x/a 
and y/a are also functions of this single variable. Hence the shock pattern 
expands uniformly in time. This could have been deduced directly in 
advance by dimensional arguments. There is no fundamental length or 
time in the problem so that all flow quantities must be functions of x/dof 
and y/dot. This is equally true in the exact formulation and in the 
approximate geometrical theory. 

The first disturbance propagates out on the characteristic B= ac(M,). 
Since αὶ measures distance from the wall on the initial undisturbed shock 
and αΞΞ αρί, the velocity in physical space is agc(M)). One of the few 
quantities that can be determined in the exact formulation of this problem 
is the speed of the first signal. According to the theory of sound, the first 
possible disturbance from the corner travels out into the flow behind the 
shock with the local sound speed a relative to the local flow velocity uw. 
Therefore the disturbance travels along the shock with speed 

1/2 


{a?—-(U-u)}, (8.89) 


where U is the shock velocity. The quantities U,a,u all can be expressed in 
terms of Mo, and it is found that (8.89) is agc* where 


 {(μ4- ἡ[- ἡ Μ2:21}. 
ἐπὶ (y+1)M2 | | 


This is to be compared with 


Ν {Meal 
Co™ ca)=| MM) ; 


where A(M) is given by (8.26). For weak shocks, 


1/2 
com | 5 (Mo~1) | tL {2(My-1)}'7, My 1s (8.90) 
for strong shocks, taking y = 1.4, 


Co~0.4439M,,  c*~0.4082M,, Μρ.»οο. (8.91) 


Thus the dependence on M, is the same, and in fact there is reasonable 
numerical agreement for M,>2. For weak shocks, cg=4c*. One could 
claim that c* gives only the speed of the first signal and the main 
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disturbance could in fact come later. But on the whole the evidence seems 
to be that for weak shocks the true disturbance is distributed over the 
whole sonic circle and the approximate theory concentrates the distur- 
bance roughly halfway out. We shall see below that the prediction of the 
total magnitude of the disturbance, as evidenced by the value of M,, is 
very good and the concentration of the disturbance in this theory is 
unavoidable. For stronger shocks the disturbance is more concentrated 
and the approximate theory represents the behavior very well. One might 
add that the theory emphasizes local behavior near the shock, and this is 
obviously better for stronger shocks. Fortunately, the problems of weak 
shocks are less interesting and can be handled by linear acoustics in any 
case. 

Even though the exact formulation for diffraction around a sharp 
corner can be reduced to a similarity solution in x/agt,y / Apt, very little 
can be done with it in general. However, for small angles 6, the flow 
behind the shock can be linearized and the solution carried through. This 
was done by Lighthill (1949). We may compare our results with Lighthill’s 
results in this special case. For small θ.,, (8.84) for the Mach number M,, at 
the wall may be approximated by 


1/2 


M3 -1 | 
=| Seay | θ,, οτος (8.92) 


We compare this with Lighthill’s results in the two extreme cases M,—>!1 
and M)-— oo. For weak shocks | 


w 


; 1/2 
M,~Mo~| 5(Mo~1)| 6, 


whereas Lighthill has 8/37 times this. For strong shocks 


Lighthill’s value has to be taken from a graph but the numerical factor 
appears to be about 0.5. Lighthill’s theory shows that the disturbance is 
spread out over the entire sonic circle for weak shocks, but for stronger 
shocks it concentrates more and, in fact, the curvature tends to infinity as 
M,-> 00. 

In view of the relative simplicity of this approximate theory, the 
results are remarkably good. Lighthill’s analysis, limited to a sharp corner 
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and to a small angle, is already quite heavy by comparison, and the 
approximate theory may be applied to the enormous variety of problems 
for which other analytic solutions have not been found. The results should 
be good except for very weak shocks and even there the prediction of total 
Mach number change should be good. Experimental tests showing this 
agreement will be noted later. 

The solution for any initial Mach number and any corner angle is 
given in (8.87) and (8.88). The formulas simplify in the limit of strong 
shocks, M,—>0o, and the form of the solution becomes clearer. For strong 
shocks, the appropriate expression for M, becomes 


M,=M ΄» (8.93) 
= exp| — |], ᾿ 
w 0 Pp Vai 
and in the fan 
— ,1/(ne1) 
M _{ BVn 
μι Moa i 
(8.94) 
Vn BVn 
ὦ n+} log Mya 


The equation of the shock at time t=a/dy is found from (8.88) with 
J(M)=M~". It is easiest to use @ as a parameter instead of 8, and to fix 
the constants of integration from x= M aot, y= Myaot/Vn , when 0=0. 
Then we have, 


ie 
~ -[Ξ:}} 65) Μῆ sin(n -- θ) 


t 
ἐὺ - ὩΣ 6,.<0<0, (8.95) 
ΟΞ τ a0) = 
Μ αι [ " ) e cos(n — A), 


where tann=Vn. The shape of the shock is plotted in Fig. 8.8 for 
6,= -- π͵72. The similarity form of the solution has already been noted, 
and we add that for strong shocks the solution scales with M, also. 

The solution for various M, has been compared with experimental 
results by Skews (1967). The agreement is found to be reasonably good 
and typical results are reproduced in Fig. 8.9. 

For strong shocks there is no limit on the magnitude of @, for which a 
solution can be found. For sufficiently weak shocks there is a limit, since 
M,, cannot decrease below unity. Hence if @, decreases below the value 


Fig. 8.9. Shock diffraction: comparison of experimental results (solid lines) with theory 
(dashed lines). (Skews) 
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θα given by 


9 Mo dM 
lim —_ J Ac ᾽ 


there is no solution. Presumably this corresponds to strong separation or 
other effects at the corner, but at present the interpretation is unclear. 


Diffraction by a Wedge. 


For a concave corner, the waves on the shock break and a shock- 
shock must be introduced into the solution (8.85), using the jump condi- 
tions established in Section 8.6. We consider in detail only the solution for 
a sharp concave corner, which is equivalent to the problem of diffraction 
of a plane shock by a wedge. This problem, has received considerable 
attention in the literature (see Courant and Friedrichs, 1948 p. 338). In the 
approximate theory the solution is simple. The solution is a shock-shock 
separating two regions in which M and @ are constant, as in Fig. 8.10. 
From (8.81), the Mach number at the wall is obtained by solving 


1/2 172 
᾿ς (δ. μ᾿ AEA)? Ay _ Κρ 
ΑΜ ἜΛΟΜς ᾿ Ao (Mp) 


(8.96) 


The angle x for the line of the shock-shock shown in Fig. 8.10 is expressed 
from (8.82) as 


tan(x—9,) = (8.97) 


Ao 


A | 1-(Mo/M,)° ᾿ 
1- (4,,.4.)}} 


--Π 9, 


Fig. 8.10. Mach reflection at a wedge. 
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For strong shocks 


n 


A, [{M, 
Ay \M,}? 


w 


and x becomes a function of θ᾽, alone. It is plotted in Fig. 8.11. 

In reality, the true configuration is the Mach reflection with a third 
reflected shock and a vortex sheet, as indicated in Fig. 8.10. Moreover, the 
“Mach stem,” the part of the shock near the wall, is slightly curved. The 
gas dynamic shock relations for the three shocks provide relations between 
the angles of the flow and the shocks at the triple point. If we assume that 
the Mach stem is straight, these allow an alternative determination of x as 
a function of 0,. This is the broken line in Fig. 8.11. The difference for 
small 0,, is about as expected, being of the same order as the discrepancy in 
(8.91). Then, fortuitously, the curves come closer together and cross. In the 
three-shock theory there is an upper limit on 6, at which Mach reflection 
goes over into regular reflection (see Section 6.17), while the simplified 
shock-shock relations continue to predict a very tiny Mach stem. However, 
for @ greater than about 70° the Mach stem is so small that we have 
virtually the same picture as regular reflection. Again we conclude that the 
theory is remarkably good. 


Diffraction by a Circular Cylinder. 


Perhaps the most severe test of this theory is the application to 
diffraction by a circular cylinder which was carried out by Bryson and 
Gross (1961) and then compared with their experimental results. There is a 


5° 
8 
O° 10° 20° 30° 40° 50° 609 70° 80° 90° 


Fig. 8.11. The Mach reflection angle x-@ versus wedge angle @. The solid line refers to the 
present theory; the broken line refers to the three-shock theory. 
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y = Shock -shock 


Fig. 8.12. Diffraction of a shock by a sphere or a cylinder. 


difficulty in starting the solution at the nose, but Bryson and Gross 
proposed a satisfactory way to handle this. First of all, the shock suffers 
regular reflection until an angle of about 45° around from the nose is 
reached, when a Mach stem is formed and subsequently grows. As noted 
in Fig. 8.11, the approximate theory predicts a Mach stem for all 8, up to 
a/2. Bryson and Gross adopted the view that this is virtually regular 
reflection if the Mach stem is extremely small. There is still a difficulty in 
getting the calculation started at the nose, however, since the behavior 
there is singular. They adopted the following procedure. It is assumed in 
the early stages that the small Mach stem is straight and radial, as in Fig. 
8.12. If its length is ὃ at an angle φ around from the nose, and the radius of 
the cylinder is normalized to unity, the undisturbed rays contained in a 
stream tube of area A,=(1+ b)sing pass through the area A, = δ. Hence 


b _ f(M)) 
(1+b)sing f(M,)— 
Since a is continuous at the shock-shock and is given by x/M, in the 


undisturbed part of the shock, we have a={1—(1+))cosey}/M,. The 
Mach number M is given by 


(8.98) 
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Shock-shock 


0-12 


0:04 


0 10° 20° 30° 40° 50° 
ῴ 
Fig. 8.13. The shock-shock standoff distance. 


at radius R. Putting these two together and taking the mean position 
R=1+ 45 for M,, we have 


ia ἘΝῚ} 
1+b/2 dp 


My 
— {1—(1+ b)cosq}. (8.99) 
M, 

Equations 8.98 and 8.99 provide a differential equation for b(q) to be 


solved subject to the initial condition b=0,@=0. For strong shocks, 
M,>1, it is 


= 175 
db 1+b/2 b 
—=(1+))t - ------------------ 8.100 
dy ( Jang cosm | (1+5)sing ( ) 
For small φ, 
b=sin”*'g, g<l. 


The solution of (8.100) is plotted in Fig. 8.13. Bryson and Gross use this 
solution up to y=45° and then continue with the detailed characteristics 
solution. When the two Mach stems meet behind the cylinder a second 
shock-shock is formed. The results are shown in Fig. 8.14 and are com- 
pared with their experimental observations. The theoretical shock positions 
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y/D 


455 


x/D 


Fig. 8.14. Diffraction by a cylinder. M,)=2.81: circle=Re7.79 x 10*; triangle = Re0.87 x 107, 
cross = vortex locus. (Bryson and Gross, 1961.) 


and the two shock-shocks are the full lines, the rays are the broken lines. 
The circles and triangles are experimental points for the shock-shock 
positions at Reynolds numbers Re=7.79 x 10*, Re=0.87 x 10‘, respec- 
tively. In the experiments a vortex is formed near the front and its locus is 
shown by the crosses; it is, of course, not included in the simple theory. 
Schlieren photographs of the flow pattern are presented in Figs. 8.15a,b,c. 


Diffraction by a Cone or a Sphere. 


For three dimensional problems, the formulation in (8.50)-(8.52) is 
used. For axially symmetric problems, the first two equations are 


a. "ae 
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Fig. 8.15. (a) Schlieren photograph of shock diffraction on a cylinder of } in. diameter. 
M,=2.82. Note the boundary-layer separation starting. Notation: I.S., indicent shock; M.S., 
mach shock; R.S., reflected shock; C.D., contact discontinuity; T.P., triple point; V., vortex. 


(Bryson and Gross 1961.) 
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Fig. 8.15 (Ὁ) Schlieren photograph of shock diffraction on a cylinder of | in. diameter. 
M,y=2.81. Notation: 1.S., incident shock; M.S., mach shock; R.S., reflected shock; C.D., 
contact discontinuity; T.P., triple point; V., vortex. (Bryson and Gross 1961.) 
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Fig. 8.15 (c) Schlieren photograph of shock diffraction on a cylinder of } in. diameter. 
M,=2.84. Notation: M.S. mach shock; R.S. reflected shock; C.D., contact discontinuity; 
T.P., triple point; V., vortex. (Bryson and Gross 1961.) 
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where x is distance along the axis and r is the radial distance. It is again 
convenient to introduce the ray angle @ by 


cos@ sin @ 
Ox Μ᾿’ ig Μ᾿ 
and work with the set 


d {rcos@ 0 (rsin@ 
A _ SM) 
Ag f(M)) 


The boundary condition on a solid wall r=r,,(x) is tan? =r/(x). 

For diffraction by a cone, the solution is a similarity solution in which 
all quantities are functions of r/x. Equations 8.101 can be reduced to 
ordinary differential equations which have to be solved subject to the 


0 10° 20° 322. 409 500 60" 70° 80° 


Fig. 8.16. Comparison of theoretical and experimental results for the shock-shock angle in 
diffraction by a cone. (Bryson and Gross, 1961.) 
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Shock-shock 1 


Characteristics 


_- Τηρὶς point 2 


x/D 


Fig. 8.17. Diffraction by a sphere. Circles represent My=2.85; crosses represent M)=4.4l. 
(Bryson and Gross, 1961.) 


conditions on the wall and at the shock-shock. The details are given in the 
original paper (Whitham, 1959b). Bryson and Gross extended the calcula- 
tions and compared the results with experiments. Figure 8.16 compares 
shock-shock angle x with wall angle 6,, for My=3.68. 

For a sphere, Bryson and Gross performed a characteristics calcula- 
tion for (8.101), starting the calculation from an approximate treatment of 
the nose region analogous to their method for the cylinder. Their results 
are not as detailed as for the cylinder, but the agreement between theory 
and experiment shown in Fig. 8.17 is equally good. 


8.8 Stability of Shocks 


The theory puts into quantitative terms one of the arguments that has 
always been used to explain the stability of plane shocks. Suppose that for 
some reason a portion of the shock has developed a bulge as shown in Fig. 
8.18. The delayed part is now concave forward and so will strengthen as it 
propagates. As it strengthens, it speeds up and thus tends to reduce the 
bulge. Similarly, any section of the shock ahead of the rest weakens and 
slows down,. The overall effect is one of stability. The arguments for 
changes of strength depending on the curvature are put into quantitative 
terms in the A-M relation. 
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Fig. 8.18. Sketch of a shock positions (solid lines) and rays (dashed lines) for nonlinear 
resolution of a caustic. 


In linear geometrical optics, a concave portion of a wavefront would 
produce a caustic, since the linear rays would be normal to the initial wave 
front and form an envelope (refer back to p. 247). As the wavefront 
propagates down the converging ray tubes it strengthens, and its strength 
tends to infinity as it reaches the caustic. But in the linear theory the speed 
is unchanged, and hence the rays remain the same. In the nonlinear theory 
developed here, the shock speeds up as it strengthens. This pushes the rays 
apart and there is no overlap and no caustic. The shock overshoots as 
shown in Fig. 8.18 and the disturbance evens out as it spreads along the 
shock. 

In detail the problem would be formulated as an initial value problem 
with M and @ prescribed on the initial shock position. In two dimensions 
the problem would be exactly analogous to the problem discussed in 
Section 6.12. There would be an initial interaction region and then the 
disturbance would separate into two simple waves moving in the positive 
and negative directions along the shock. In each the total change in 9 and 
M would be zero, so that they would ultimately take the N wave form with 
shock-shocks at the front and at the back and a linear decrease of 6 
between. The shape would be that of Fig. 8.18. Detailed calculations will 
not be given here. According to the general results established earlier, the 
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shock-shocks decay like ¢~'/?. For a uniformly distributed disturbance 
such as an initial sinusoidal shape, the disturbance eventually decays like 
1/t (see Section 2.8). 


Stability of Converging Cylindrical Shocks. 


An interesting and important question arises concerning the stability 
of converging cylindrical and spherical shocks. The expected intense 
pressure at the center would be considerably reduced by imperfect focus- 
ing. Experiments by Perry and Kantrowitz (1951) showed very symmetrical 
shapes for weak and moderate shocks, and some indication of instability 
for strong shocks, although the conclusions do not seem to be clear-cut. It 
is interesting to analyze the question using this theory. Local corrugations 
on the shock will have the tendencies described for plane shocks, but these 
have to be superimposed on an overall convergence and strengthening of 
the whole shock. Delayed parts will have the tendency to strengthen but 
the other parts are already strengthening due to the general convergence 
and are nearer the center. So the delayed parts could continue to lag 
behind and could possibly be left further and further behind. While the 
radius is sufficiently large it seems clear that the behavior would be close 
to that of plane shocks and the propagation would be stable. The question 
then concerns the behavior as the strength becomes large close to the 
center. 

The problem for strong cylindrical shocks was analyzed by Butler 
(1955) using a small perturbation treatment which implicitly included the 
approximations of the ray tube theory. With the general formulation 
developed here, it can be handled more easily and without making small 
perturbation assumptions. For strong shocks, the two dimensional equa- 
tions (8.59)-(8.61) may be written 


26, "Μ᾽ aM _ 
0β M"t2 θα 


0, (8.102) 


ὃ M" 9M 9 


aa + Mi ap” (8.103) 
The symmetrical solution for a shock with initial radius Κρ is 
~1/(n+1) 
_ B _ n+) Moa 
= ΓΝ M=M, ~n Ro 5 α «0. (8.104) 
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This is, of course, Guderley’s solution. 

To study perturbations to this solution, we use a hodograph trans- 
formation of (8.102)-(8.103), and interchange the roles of dependent and 
independent variables. This produces linear equations without any 
approximation as to the size of the perturbations. First we introduce new 
variables 


cau (n+1)0 Mya 
q={ = 3 Θ- ------ a ~ 
Mo Vn Vn 
and (8.102)-(8.103) become 
oq 3,00 _ 
ὃς "4 OR "Ὁ 
(8.105) 
0 
a0, 4 _, 
ds dp 


In these variables the symmetrical solution is gx1/s, Θα β. In the 
hodograph transformation, f,s e meted as functions of g,@. The trans- 
formation formulas are 0, =Js,, —JB,, Ig = 59. 9,=J Bo, where J 
is the Jacobian 0(q, ®)/0(s, B). αἰ ἜΜ. in (8.105) become 


Bet q’s,=0, B, + 59=0. 
When β is eliminated we have the single equation 


qSqq + 245, = 599. (8.106) 


Solutions to this (by separation of variables) are 
s=qteim® μ-- 2: 1- ἢ : (8.107) 


If m=0, με --Ἰ gives the symmetrical solution. If m>1, Ryu=—4 
Therefore when g—0o as the shock contracts to the center, the Banianies 
dominate the symmetrical mode. Hence the shock is unstable. 

The imaginary part of » shows that the disturbance consists of waves 
traveling around the shock. When the disturbance becomes large it is 
possible for J to vanish. This means that the mapping from the (q,@) to 
the (s,8) plane is no longer single valued, and it corresponds to the 
appearance of shock-shocks. When this stage is reached, further calcula- 
tions would have to be carried out numerically in the (s, 8) plane. 
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8.9 Shock Propagation in a Moving Medium 


For propagation into a moving stream, the linear theory shows con- 
clusively that rays are not orthogonal to the wavefronts (see Section 7.9 
and Fig. 7.12). Correspondingly, we cannot expect the rays to be orthog- 
onal to the shocks in the nonlinear theory. At first sight this poses a 
problem since the nonlinear formulation relied on the orthogonality to 
compare propagation in a ray tube with propagation down a given chan- 
nel. However, a way out is to consider propagation in a uniform stream as 
a test case. In a frame of reference moving with the stream the original 
formulation applies. It remains only to make a Galilean transformation to 
another moving frame to see the correct formulation for moving media. 
Then, as expected, the rays are not orthogonal to the shocks. The re- 
sults have been given (Whitham, 1968) and applied by Huppert and 
Miles (1968). It would be interesting to pursue this investigation to see how 
the theory for a moving medium might have been formulated directly. It 
indicates that the closeness of rays to streamlines is not as important as 
was originally thought, and this might lead to more novel points of view. 


CHAPTER 9 


The Propagation of Weak Shocks 


As indicated at the beginning of Chapter 8, we can pursue a different 
approach and cover a different class of problems when the waves are 
moderately weak. The geometrical effects are accepted unchanged from 
linear theory and we are then able to cope with more general nonlinear 
interactions within the wave profile. The approximation procedures will be 
developed for unsteady waves, using spherical and cylindrical waves as 
prime examples, and then applied more specifically to the sonic boom 
problem, which is perhaps the most interesting situation where weak 
shocks have to be studied. The unpleasant aspects of sonic booms loom 
large, but they are in fact extremely weak shocks and, naturally, the aim is 
to make them weaker. The maximum overpressure at the ground for 
current and contemplated supersonic transports is about 2 Ib/ft*; this 
corresponds to a shock strength of about 10~°. The basic problem for 
uniform velocity and flight path may be treated as one in steady super- 
sonic flow, so this work also continues the development in Section 6.17. 


9.1 The Nonlinearization Technique 


Geometrical effects arise in their simplest form for spherical waves. If 
the linearized theory is governed by the wave equation, the solution for an 
outgoing wave may be written* 


_ f(t—r/c9) 
Veo = 


r 


(9.1) 


where Cy is the propagation speed. The amplitude decays like 1/r as the 
energy spreads out across a surface area increasing like r*. The wave 


*It was convenient in Section 7.3 to use different symbols R and r for the radial distances in 
spherical and cylindrical geometry, since the solution for a point source was used to generate 
the solution for a line source. It is no longer necessary and we use r in each case. 
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profile is reduced by the factor 1/r but is otherwise undistorted. If this is 
the linearized solution to a nonlinear problem, we know that the nonlinear 
distortion of the profile will be crucial for a correct treatment of wave 
breaking and shock propagation. Suppose that the correct nonlinear pro- 
pagation speed determined from the characteristic equations is in fact c(q), 
the linearized speed cy being its value at φῦ. Then the nonlinear 
distortion may be introduced by modifying (9.1) to 


ΩΝ (9.2) 


r 


where 7(t,r) is to be determined so that the curves t=constant satisfy the 
exact characteristic condition. That is, we require 
dr 


7a c(¢) on 7 =constant. (9.3) 


Since Φ is expressed as a function of + and r, this provides a differential 
equation to determine τ, and the inverted form can be integrated im- 
mediately. We have 


a ee 
dr οὐ f(t)/r} 


on 7T=constant; (9.4) 


hence 


dr + T(r), (9.5) 


J They) r} 


where the integration is carried out holding + constant and TJ(r) is an 
arbitrary function allowed by the integration with respect to r. Equation 
9.5 determines 7(¢,r) implicitly and the combined equations 9.2 and 9.5 
provide us with a “nonlinearized” solution. This nonlinearization techni- 
que was proposed first by Landau (1945) and independently by the author 
in the context of the sonic boom problem (Whitham, 1950, 1952). 

The function T(r) corresponds to the arbitrariness in the choice of the 
characteristic variable. Once 7(7) has been chosen, f(r) is determined from 
an appropriate boundary condition at the source. Different choices of T(r) 
are compensated for in the resulting f(r). For the general development one 
may take 7 (τ) ΞΞξ τ for simplicity but the extra flexibility is sometimes useful 
in specific problems. It should be noted that the functions f in the linear 
result (9.1) and in the nonlinearized version (9.2) will be the same only if 
T(r) is chosen so that r=t—r/cy (to a sufficient approximation) where the 
boundary condition is applied. 


314 THE PROPAGATION OF WEAK SHOCKS Chap. 9 


Of course this nonlinearized solution usually does not satisfy the 
relevant nonlinear equations exactly, nor has it been deduced at this point 
as a formal approximation. However, it appears to include the important 
nonlinear effects for small φ. In the simpler case discussed in Section 2.10 
and in the case of plane waves, we saw that the linearization of the 
characteristics was the source of the nonuniform validity. Nonlinearity will 
also modify the amplitude factor 1/r, but one would expect this second 
modification to be uniformly small in φ. This view will be substantiated 
later when we look more carefully at the justification of the procedure. 
First, the further consequences and extensions of the procedure are 
mapped out to see its full scope. 

Since » takes a particularly simple form for spherical waves, the 
integral in (9.5) is tractable and can be simplified by a change of variable 
to f(r)/r. In other problems, however, the corresponding expressions are 
less amenable and it is valuable to streamline the analysis. In using the 
linearized result as a starting point, we have already assumed that φ is 
small so it is consistent to expand c(q@) as co+c,p + Ο(φ2), say, and to use 
(9.3) in the approximate form 


ς 
ca ΤΕ -“φ. (9.6) 


The expansion is taken for dt/dr rather than dr/dt in view of the 
subsequent integration with respect to r. Then corresponding to (9.4) we 
have 


eo a - a ao on t=constant, (9.7) 
and the characteristics are 
r Cy 
t=—~— —f(r)logr+ T(r). (9.8) 


The linear theory would take r=t—r/co or a function of it, and we see 
that it is not uniformly valid because the additional term tends to infinity as 
r—oo. The term in logr is small compared with r, but it should be 
compared rather with cyt—r, which measures distance from the head of 
the wave. The view that the correction of the propagation speed would be 
crucial is confirmed, and there is a close analogy with the situation 
discussed in Section 2.10. 
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The approximation of (9.4) by (9.6) is not only a simplification. In 
most problems it would actually be inconsistent to keep higher order terms 
in φ, since (9.2) is itself only a first order approximation to 9. 

The singular behavior of logr as r—>0 is only a minor nuisance since 
the correction term is not important near the origin and we could revert to 
linear theory there. However, to use (9.8) as it stands we must exclude the 
origin and apply the solution outside some sphere r=r,(¢) on which 
boundary data are given. (For example, a fluid source could be represented 
by an expanding sphere pushing out the fluid.) We may then choose 7(7) 
so that (9.8) becomes 


r Cy r 
t= — — —f(r\log —— +r. 9.9 
“πὲ alos +4 99) 


With this choice, the nonlinear τ agrees with ¢—r/cy on the boundary 
curve and the function f is the same as in the linear theory. 

Waves described by (9.2) and (9.8) will break whenever the 
characteristics form an envelope and the solution becomes multivalued. If 
c, >0, a wave carrying an increase of pm breaks. Assuming that T’(r)>0, 
this means that breaking occurs when f’(r) >0. On the envelope, 


os eee 
eal (logy T’(r)=0. 


For (9.9), breaking first occurs at a distance given by 


log ——~ ae (9.10) 
ro(Tn) Cif'(T%m) ) 


where 7,, corresponds to the maximum of f’(r). Then a shock must be 
fitted in. The techniques of shock fitting follow closely the earlier treat- 
ment and we defer the discussion for the present. 

We now consider extensions of the procedure. First, the linear solu- 
tion may not be as simple as (9.1). For example, in cylindrical waves the 
solution (7.29) is 


t—r/co 


I q(n) dn 


φΞ -- ------ 
2π 2 
0 V(t—n) - "2 )οῦ 


The characteristic variable t—r/cy is significantly visible in the upper 
limit, but both ¢ and r appear also in the integrand. However, the evidence 


(9.11) 
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from plane and spherical waves is that the nonlinear effects become 
important at large distances. And at large distances, we saw in (7.32) that 


f(t—r/ co) 
ae ae (9.12) 


Therefore the nonlinearization for large distances can follow closely the 
spherical case. If the correct propagation speed is c(~)=c¢y+c,p+ Ο(φῦ, 
we take 


τὴν (9.13) 


26) , 
ΒΝ Seapets /2 
t a Κτ)ὴγ +7. (9.14) 


Here the function 7(7) arising in the integration with respect to r has been 
taken to be τ; the correction term remains small as r-0 and there is no 
need for more elaborate choices of T(r). Again, the linear theory, which 
takes r=t—r/co, is not uniformly valid as roo. Moreover, although the 
emphasis so far has been on the behavior at large distances, the deviation 
of the characteristics from the linear ones depends on 


fr? 


=e (9.15) 


this is large near t—r/cy=0 as well as for large r. So the nonlinear 
correction will be equally important near the wavefront t—r/c)=0 at all 
distances. Significantly, (9.12) is valid for (cgt—r)/r«1, so that it covers 
both situations; the corresponding nonlinearized solution given by (9.13) 
and (9.14) is also valid near the front of the wave for all r. This is of 
tremendous importance since the most interesting problems will have a 
shock at the head of the wave and the nonlinearized solution derived from 
(9.13) can be used to study it in its entirety. 
We may nonlinearize the whole solution by taking 


if q(n)dn 


02S) ξεξ  - - - 5 - Ὸ 
0 γίτ--η)ί(τ--η- 2,760) 


(9.16) 
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and determining the nonlinear τ from this. In this more complete form, the 
characteristic equation corresponding to (9.14) becomes quite complicated. 
But the extra terms remain small compared with f(r)r'/*. It is therefore 
sufficient to take (9.16) combined with (9.14) as a nonlinearized solution 
over the field. In any case, the nonlinearization is of prime importance in 
the region cyt /r<1, where (9.16) may be approximated by (9.13) with 


2 " φ(ηγάη 
f=-a(2) | Vea 


It should be noted, however, that appropriate boundary conditions are 
normally prescribed outside the region cyt/r«1, so that either (9.16) or 
the full linear solution to which it reduces is required to determine the 
function f(r) that appears in (9.13)-(9.14). 


The basic role of geometrical optics now becomes apparent, for (9.12) 
is the geometrical optics approximation to cylindrical waves. In general, 
geometrical optics provides a ray geometry and (for uniform media) we 
have 


p= 0(s)f(1- =] (9.17) 


along each ray, where s is the distance along the ray, ®(s) is the amplitude, 
and f(t—s/co) describes the wave profile. This is the natural form for 
nonlinearization and is applicable just where the nonlinear effects are most 
important—near the head of the wave and at large distances. The non- 
linearization follows by taking 


p=P(s) f(r), 
᾿ : (9.18) 
= Ἢ - aly f O( 5’) ds’ + Τ(τ). 


Combining the results for each ray we have a complete nonlinearized 
solution. It should be noted that c(q) refers here to the velocity on the ray 
and this is not the same as the normal wavefront velocity for anisotropic 
media. For nonuniform media, c and cg may also depend on 5. The s/ Cp is 
replaced by fds/cg and any dependence of c,/cZ on s must be included 
under the integral sign in (9.18). 

At this point, the procedure can be compared with the one in the last 
chapter. Roughly speaking, the f(r) for those problems was a step function, 
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so the nonlinear interaction took a mild form and strong nonlinear effects 
on the ray geometry could be included. Here the ray geometry and its 
effect on the amplitude ®(s) is accepted unchanged from linear theory, but 
more general profiles f(r) can be handled. Presumably a combination of 
both approaches would be needed in still more general problems, but the 
analysis looks forbidding. 

A second extension of the techniques is required because the non- 
linear propagation speed is often a function of the derivatives, φ, and q,, 
rather than of itself. However, the procedure goes through. The expres- 
sions for , and q, are each written in the form corresponding to (9.17) and 
the revised characteristics are determined from the appropriate expansion 


La er ae QP, — αγφ, on 7 =constant. 
ds ας C9 


From (9.17) the corresponding first terms for ,,q, are 


%,=8(s)f'(t), φ,--- 2, MOS) (1), 


and the characteristic relation becomes 


Ot A gs fe ἂν di 
ἜΜ kf'(1) ®(s), Κτεαι -- Co. 


The characteristics are given by 
S , " / 
== —kf'(1) { O(s")ds +T(r). (9.19) 


A specific example of this is provided by spherical waves in gas 
dynamics. The linear theory is the acoustic theory and q,,q, are related to 
the pressure and velocity perturbations. From (7.3)-(7.4) we have 


ρΡ-ρ Ὑ _ YF(t~r/ap) 
Po ae : r 
wid _FUt-r/a)  f(t—r/aoy 
ay a Tr r Agr? ᾿ 


where F(r) = -- (τ) αὖ. The perturbation in the sound speed a will also be 
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needed, and it is given by 


The nonlinearized solution is 


- F - -1F 
ad aie ὦ I ace (7) (9.20) 
Po r Ag 2 r 
F 
Bg ND), (9.21) 


where 7(t,r) is to be determined from the improved characteristics. The 
exact characteristic equations were given in (6.135). The outgoing 
characteristics have velocity a+u. Therefore the first order correction to 
the characteristics requires 


ἄς δ hte (9.22) 


«--... -.--Θ---..:. - ...... ς-κ-  ..ς 
= --, ,ἅἅὕ.-. 


dr ay 2a Fr ager?” 
(9.23) 
r Ύ Ι f(r) 
ah 05 age πὴ 
Ay jag ee ἢ) 


[The relation between F(r) and f(r) is modified to [{{)-Ξ — agF(1)T (τ) if 
T'(r)#1.] Since our interest is in the region αρτ(γ 1, and the term f(r)/r 
is always relatively small in this region, it is sufficient to use 


P—Po _ YF(*) a-ay_y-1 Fr) FU) 
Po ro ao 2 ie r 
(9.24) 
_r Υπ| 
t= an Dag F(1r)logr+ T(r). 


This is a rather trivial example of retaining only the geometrical acoustics 
approximation to (9.21) and (9.23). Cylindrical and other waves in gas 
dynamics are handled similarly and the geometrical acoustics approxima- 
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tion provides a greater simplification similar to the step from (9.11) to 
(9.12). 

When derivatives appear in the expression for c it is more convenient 
to take them as new dependent variables. Then, in all cases, the geometri- 
cal optics approximation leads to expressions for the dependent variables 
which are proportional to 


Φ( 5) (τ). 


where ®(s) is an amplitude function and F(r) describes the wave profile. 
The corrected propagation speed, using this approximation, takes the form 


C=Cyt cok ®(s)F(r), (9.25) 


where the coefficient k is a constant determined by the particular relation 
of c to the dependent variables. The improved characteristics satisfy 
ot 
—=— —-k®(s)F 
ie k®(s)F(7) (9.26) 


and are given by 


t= 5 KF (1) [ O(s’) ds’ + T(r). (9.27) 
Co 0 


Shock Determination. 


Shocks, when required, are fitted in using the weak shock condition 
Ι 
U= 5 (c,+¢)), 
where U is the shock velocity and c,,c, now denote the propagation speeds 


on the two sides. In the present context, it is convenient to describe curves 
in the (5,1) plane by giving ¢ as a function of 5, so the shock condition is 


used in the form 
dt "Ν 1 at dt 
ae 2 (4 ).+(4).} (9.28) 


which is equivalent to first order in the deviation of velocities from cy. If 
the shock is specified by 


5 
t= τ GAY); 
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we have 


G'(s)= Fk Fry) + Flr} (8), 
G(s) = kF(r,) if "(s’)ds’ — T(1,), 


G(s) = kF(15) if ®(s’) ds’ — T(1). 
0 
We then deduce the typical “equal area” relation 


1 (παρ + F(ra)} (Tra) — Tn) } = [FCAT (x). (9.29) 


For a head shock moving into the undisturbed region, the shock 15 
determined by (9.27) with 7 related to s by 


Ι 5 δ , ἜΣ τ ; ᾿ 
5ΚῈ (7) [90 γώ = [Ἐ(τγαΤίτ}. (9.30) 


As s—>0o, the equation of the shock asymptotes to 


: 1/2 
= 2-K| foo} £T(%), (9.31) 
Co 0 
where 


1/2 


Ke [2K f“F(r)ar(~) | ο΄ -F(1) =0. (9.32) 


At the shock the flow quantities are proportional to 


ἘΣ 


KO(s){ [Φγώ , (9.33) 


The typical asymptotic wave form is the N wave, with balanced shock 
waves, and between the shocks the linear decrease in time is proportional 
to 


ΞΞῚ 


ΦΟ) } [Φ γώ} | (9.34) 
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For spherical waves ®(s)=1/s and the shock strength (9.33) decays 
like s~'(logs)~'/2, only slightly faster than the decay for linear pulses. For 
cylindrical waves ®=5~'/? and the shock strength decays like s~3/4. Of 
course plane waves are also covered by these formulas; ® is constant and 
the decay law is s_'/?, in agreement with earlier results. The asymptotic 
decay laws for cylindrical and spherical waves were obtained indepen- 
dently by various writers and the first was probably Landau (1945). 

For more general two and three dimensional waves in a uniform 
medium 

®(s)xA7~'/2(s5), 


where A(s) is the ray tube area. Further details and applications may be 
found in an earlier paper (Whitham, 1956). For nonuniform media s/cg is 
replaced by f{ds/cy and all the dependence on s in (9.26) must be included 
in ®(s). 


9.2 Justification of the Technique 
There are several approaches by which the nonlinearization technique 
can be examined mathematically on specific systems and each one throws 


light on different aspects of the approximations. 
First, suppose the nonlinear equation for is 


Be 
φ, + (cot C19), + = =0. (9.35) 


This is proposed as a model in the first instance, but we shall see a tie-in 
with other cases later. The linearized equation is 


ς 
+ CoP, + ἊΣ =0, (9.36) 
and its solution is 
f= Cc 
Pree τ ὦ */¢0) (9.37) 
xh 


This is analogous to spherical waves for B=1 and cylindrical waves for 
= 4. The characteristic form of (9.35) is 


dy βορ 
(cot 19) ee (9.38) 
at l 


(9.39) 


Sec 9.2 JUSTIFICATION OF THE TECHNIQUE 323 


The exact solution of (9.38) is 


f(r) 
= (9.40) 


pe 1?/60 = 


where τ is the characteristic variable to be determined from (9.39). It 15 
clear that 


_ f(t) 
= x8 


(9.41) 


is a uniformly valid approximation to (9.40) for small φ. This confirms the 
main step in the technique. The determination of τ can be examined using 
the expansions of (9.39) and (9.40) in powers of φ, and the expansions are 
convergent for |g|<cp/c,. We obtain 


2 
(ie Werte Le eres SS a 
dx Co x8 x28 


with coefficients y, related to cg, c,; in particular y, = — c,/ cg. Hence 


2 
A a BE ett 


(Logarithms replace the corresponding powers when B= 1, 5, etc.) The first 
uniformly valid approximation is 


i=T(1)+= + (9.42) 


= x γ᾽ f(r) 1--Α 
t=T(r)+ rs + i-B Χο, (9.43) 
and it agrees with the result obtained from (9.41) and 
dt il % 
aes 2 Q. (9.44) 


Therefore the uniformly valid approximation given by (9.41) and (9.43) is 
exactly the one that would be obtained by the nonlinearization procedure. 
Notice that it would, in fact, be inconsistent to use further terms in (9.44) 
without further terms in (9.41). 

The remaining approaches are illustrated on the equation for spherical 
waves in gas dynamics to keep the algebra as simple as possible, but it 
seems clear that they would go through (with possibly minor alterations) in 
other cases. It will be sufficient, again for simplicity, to give the details for 
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isentropic flow; the methods are not limited by this and, even when shocks 
are present, the entropy changes for weak waves do not affect the lowest 
order terms. The full equations are given in (6.132)-(6.134), and for 
isentropic flow they may be reduced to the following pair of equations in 
sound speed a and radial velocity u: 


=] 
ae A pee = 0, (9.45) 
t r > r 


u,+uu,+ τ δα =0. (9.46) 


Small Parameter Expansions. 


One obvious approach is to continue the naive expansions in small 
amplitude beyond linear theory, see what goes wrong, and correct it. It 
may now be helpful to display a small parameter ε explicitly; ε would be 
taken, for example, as the maximum value of yu / a, ON some initial surface. 
The naive expansions would then be 


u=eu,(r,t) + €7u,(r,t) + tee, 


a=daytea,(r,t)+ €7a,(r,tht+---, 


These are substituted in (9.45) and (9.46). The coefficients of successive 
powers of ε are equated to zero to obtain a hierarchy of successive pairs of 
equations for (u),a)), (u5,a;),.... Of course u, and a, are found to be the 
linear expressions given earlier, the main terms being proportional to 
F(t—r/ao)/r. The expressions for u, and a, are then found to contain 
terms in r~'logr, r~7logr, and r~2. The first of these is responsible for the 
nonuniformity, since it makes the ratios u, / u, and a,/a, tend to infinity as 
r—>oo; the others are harmless. The expressions are 


Ωρ 2 2a r 


. ᾿ Ι F(r*)F'(1* 
ead AO τό 5 ΞΡ ΘΟ certm| + 
r Aor 


- F(r* +1 F(r*)F’(r* 

2 es ΕΣ ag] + 

y-1 a r 2a r 

where τὴ is the linearized characteristic t— r/ay. Here “>, 2, denote terms 
uniformly bounded with respect to u, and αι; F(r*)=—f'(t*)/ap as 
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before, and «F now replaces the function F in (9.20) and (9.21). We see 
immediately that the appearance of the nonuniform terms may be in- 
terpreted as the consequence of an injudicious use of Taylor’s expansion 
on the expressions 


ΓΝ F(t) f(r) | 


r Apr? 


2 ἀ- ἄρ F(1) 
tal es 


yl: <a, a 
with 
+] 
T=Tt+ an eF (7) logr. 


But these expressions would just be the proposed nonlinear solution (9.20), 
(9.21), with + determined as in (9.24) and with T(r)=7. The situation is 
closely similar to the one discussed in Section 2.10. The use of Taylor’s 
theorem in reverse is a familiar one in perturbation theory. The arbitrary 
function 7(r) allows more freedom in the choice of the characteristic 
variable 7; changes in the choice of τ are absorbed by changes in the 
determination of F(r) from boundary data so the final solution is unique. 

The foregoing investigation shows that to avoid nonuniformities one 
should start with the expansions 


u=eu,(r,7) Ὁ εἰμ4(7,1) +°°°, 
(9.47) 
a=aytea,(r,7) +€7a,(7,7) +°°-, 


where 7(¢,r,€) is to be suitably chosen. Better still, one should add the 
expansion 


=t)(r,7)+et,(7,7) +°°° (9.48 ) 


to (9.47) and determine rather the function ¢(7,7,€) by choosing ¢,(7,7), 
t(r,T),..., to avoid nonuniformities in the validity. For wave problems we 
expect the latter to stem from the requirement that curves 7=constant be 
characteristic curves. (Cases other than wave problems were proposed 
using this “strained coordinate” technique by Lighthill, 1949.) Assuming in 
advance that 7 will turn out to be the characteristic variable, it is clearly 
preferable to start from the equations (9.45)-(9.46) written with 7 and r as 
independent variables and ¢(r,r) defined by 


ar_ ἡ 
Or atu 


(9.49) 
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The equations may then be written 


Zn Or ΤΕ ἘΝ ΠΞΠῚ (9.50) 
1 atur 
a,—u,— : αα, Ἔα, τυ =O, (9.51) 
yee ot Ve] a 


The form is unsymmetrical because of the mixed use of one characteristic 
variable + and the radial distance r. However, (9.50) can be recognized as 
the characteristic equation for variations along the characteristics 
T =constant. 

Equations 9.49-9.51 are now solved by the expansions 9.47—9.48. To 
lowest order in (9.49) we have 


hence 


t= = + T(r). 


The first order terms in (9.50)-(9.51) are then 
2u, 


——a,,+u,,+ — =0, 
y—1 Ir Ir r 


2 ) 
y—-1°" a a Pe alae (7) =0. 


To solve these equations it should be remembered that they must be the 
linearized equations in disguise. It is easily verified that the solution is 


uy _ F(t) _ fr) 
2 ὦ, Εἴη) 
γ--1 a me 


where f’(r)= — a2F(r)T’(r). In the next order, (9.49) gives 


Of, u,ta, 


2 
or Ωρ 


. 
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Hence 


2. 1 f(t) 
i= γα: F(r) logr aa 


These lowest terms are precisely the nonlinearized solution (9.20)- (9.23), 
thus justifying the technique and providing a consistent scheme for higher 
approximations. 


Expansions at Large Distances. 


A variant of this approach is to use expansions of u(r,7), a(7,7), (7,7), 
not in powers of a small amplitude parameter ε but in inverse powers of r 
(supplemented by logarithmic terms when necessary) whose coefficients 
are functions of τ. This is essentially the approach used in the author’s 
earlier papers (Whitham, 1950a, 1950b). 


Wavefront Expansion. 


Another approach, which is not as strongly based on expansions in ε, 
proceeds from a close analogy with the simple waves of the plane case. The 
full characteristic forms for (9.45)-(9.46) are 


Cee Pee eae ee 
[τῷ a) a \(—2 u)+ : =0, (9.52) 
a a\(2 dau _ 
[2+ (uray |(—pa+u)+ ; =0. (9.53) 


In the plane case the terms in 2au/r are absent, and for a simple wave 
(9.52) is dispensed with quickly by arguing that 


2a 
y-1 


{7 


is constant everywhere. This conclusion can no longer be reached exactly. 
However, the change in this Riemann variable will depend on the integral 


2auZ# 


Cc. r 
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shock 


Fig. 9.1. Characteristics and shock in discussion of spherical waves. 


taken along the C_. Near the head of the wave the contribution will be 
small, since the range of integration is small (see Fig. 9.1). The relative 
change in the Riemann variable due to the integral will in fact be of order 
a t /r, since τ provides an estimate of the time change from the head of the 
wave. Furthermore, the arguments so far have indicated the region ayt/r 
<1 as the one of interest. This suggests the constancy of the Riemann 
invariant as a good first approximation for this region. If we take 


2a ae 2a 
y-1 γι] 


(9.54) 


as an approximation to (9.52); the second equation (9.53) provides a single 
first order equation for μ. Its solution requires integration along the C, 
and the range of integration on the C, 15 not small. The equation for wu is 


ou y+! \ ou od Ae 
H+ (at 1-1) + (aot : “| Ξε. (9.55) 
This is almost the same as (9.35) with B=1 and can be handled similarly. 


The comparison of (9.55) with (6.83) for the plane case should also be 
noted. The exact solution is 


2/(y-)) 
u y—-lou _ F(t) 
“(14 5 | a= (9.56) 


~=— Ὁ (9.57) 
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A uniformly valid approximation is 


F(r) +1 
ΠΗ͂Σ ΞΕ ΤΟΣ Σὺ; 
eo ανΝ ποτ ἢ F(7r)logr+ T(r), 


and, from (9.54), 


2 4-4 4 Ε(τ) 
y—-1 a Ωρ ig 


This is the proposed solution (9.24). It should be noted, however, that in 
this approach as opposed to the last one we obtain only the geometrical 
acoustics form for u and a. This is perfectly satisfactory for the behavior in 
at /r<1, but it would require supplementing by other methods to deter- 
mine F(7). 

When a shock is required at the head of the wave, the jumps in the 
entropy and the Riemann invariant (9.54) are of third order in the strength 
and do not affect the lowest order approximation. 


N Wave Expansion. 


After the shocks are included, the typical asymptotic behavior of the 
final waveform at large distances is an N wave centered around a limiting 
characteristic t). For spherical waves, with details of the coefficients added 
to (9.34), we have 


2 a-aA y 269 [, 


r -ι 
-- ΞΞ ἐ ἘΞ Τ' | . (9.58 
ey πες (το | (rlogr)"'. (9.58) 


ἄρ 


This suggests that the final N wave form could be generated directly by 
looking for solutions in the form of expansions 


u=0,(r) (FS) το, )ᾳ(ς -- 2) too 
(9.59) 
α-παρτ δι()(ζ-- 2) Ἐδ,(υ)(ζ -- So) Ἐτ.-, 


where {=t—r/a, and §, refers to the asymptotically straight characteristic 
between the shocks. If these expansions are substituted in (9.45)-(9.46), the 
successive powers of (ζ -- ζο) lead to a hierarchy of equations for (v,,5,), 
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(v,,5,),.... The first pair of equations gives 


—m_ ῦ;: (9.60) 


The first equation confirms the relation between a and u. The second 
equation may be written | 


3 


+] 
4(1)+2 Leer, 
2ag ἵἴ 


which integrates to 


2αῇ ] 


(9.62) 


ΠΣ y+1 rlogr 
and confirms the dependence on r noted in (9.58). 

This gives a very simple approach to the asymptotic behavior, which is 
one of the outstanding results of the theory. It may be continued further to 
include the shock determination. If ¢—§ = G(r) at the front shock, (9.59) 
provides power series in G(r) for the flow quantities at the shock. The 
shock condition (9.28) in this case is 


Therefore 


G(r) xlog!/*r, (9.63) 


This agrees with (9.31) for spherical waves. 

Although this final method is perhaps the simplest of all, and one in 
which higher order corrections may be easily found, it cannot predict how 
the coefficients in the shock equation and shock strength depend on the 
initial source. 
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9.3 Sonic Booms 


The central problem for sonic booms is to determine the shocks 
produced by an axisymmetrical body in steady supersonic flight. The 
effects of different body shapes, acceleration, curved flight paths, and 
nonuniform atmosphere are all developed in various ways from this basic 
problem. 

For the basic problem it is convenient to take a frame of reference in 
which the flow is steady. The linearized theory has been discussed in detail 
in Section 7.5 and the nonlinearization can now proceed in close analogy 
with the techniques developed here for unsteady waves. The corresponding 
problem of plane flow treated in Section 6.17 also contributes to the 
background of ideas. | 

If U is the mainstream velocity parallel to the x axis and the perturbed 
velocity components in the x and r directions are now denoted by U(1+ u) 
and Uv, we have 


x— Br 


S”’(n)a 
ἜΝ om aoe) δῆ : (9.64) 
0 (χ-- η)΄ -- B?? 
x— Br 
δ΄ (η)ώ 
0 (x—7)’— Br? 


where B= VM2~—1 and S(x) is the cross-sectional area at a distance x 
from the nose. The disturbance is confined behind the Mach cone x — Br 
=(0, which makes the Mach angle 


| 


+ [lg = Sin (9.66) 


1 
Μ 


μα 


Fig. 9.2. Linear characteristic pattern in supersonic flow past a body. 
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Fig. 9.3. Nonlinear characteristic pattern with shocks in supersonic flow past a body. 


with the stream direction. The quantity x— Br is the linear characteristic 
variable and corresponds to ¢—r/cg, in the discussion of unsteady cylindri- 
cal waves. In the (x,r) diagram, the linear characteristics are a family of 
parallel straight lines making an angle μὴ to the x axis as shown in Fig. 9.2. 
In the region (x — Br)/ Br<1, the approximations in (7.45)-(7.47) may be 
used. This region includes the front shock and the main part of the far 
field and it is here that the nonlinear corrections are crucial. The nonlinear 
effects modify the characteristics and introduce shocks as indicated in Fig. 
9.3. Following the nonlinearization technique, we replace x — Br by &(x,r), 
where ξ is to be determined so that the curves £=constant are an adequate 
approximation to the exact characteristics. The modified expressions for 
the flow quantities are 


u=— a0) , oF ene ; (9.67) 
V2Br V2Br 
P~Po =yM? F() Be θος ey ae ἱ (9.68) 
Po V2Br ἄρ 4 V2Br 
where 
Sg” 
F(é) = ee: 5 (9.69) 


i 


A typical F curve was given in Fig. 7.3. As noted in (7.48), F-0 as ξ-90 
and the linear theory with €=x— Br does not predict a shock. The 
nonlinearization is clearly crucial here. 

The exact equations for irrotational axisymmetric flow are the same as 
for plane flow, (6.158)-(6.159), with y replaced by r and an additional term 
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—a’v/r in (6.158). Since the highest derivatives are unaffected, the 
characteristic directions are still given by θ-Ἐμ, where @ is the flow 
direction and p is the exact Mach angle defined by p=sin~'a/q. Accor- 
dingly, on €=constant, 

ax _ 

iy =cot( w+). 
Just as in the unsteady wave problems, the first order perturbation 
approximation to this is sufficient and we use 


a = COt fly — ( fb— fg + 8) cosec? py. 


To the same order 


hence 


On substitution from (9.67) and (9.68), the equation becomes 
ax 1 (y+1)M* FE) 
or B (2B) ΩΝ 

and we have 


x= Br—kF(£)r'/7 +6, (9.70) 
where 
+1)M4 
——, (9.71) 
2 /2B 3/2 


The nonlinearized solution in the region / Br<1 is given by (9.67), (9.68), 
and (9.70). 


The Shocks. 


The characteristics overlap and a shock is required when F’(g) >0. For 
a finite body, 


: “F(é)dé=0 
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as noted earlier, so there will be two such regions in general and two 
shocks. The counterpart to (9.28) is that the shock slope is the mean of the 
characteristic slopes on the two sides, and the shock determination is 
completely analogous. If a shock is described by 


x= Br—G(r), 
we have 
G(r) =kF(&,)r'/?—&, 
=kF(&,)r'/?-&, 
where 


3 (FE) + FG) }(G-&) = [ F@ ae 


For the front shock which has undisturbed flow ahead of it, F(é,)=0 and 
€, may be eliminated from the determination. Then, dropping the subscript 
on €,, we have 


SEH E)= ['FE)dE, (9.72) 


x= Br—kF(£)r'/ +, (9.73) 


for the determination of the shock. The flow quantities immediately behind 
the shock are given by (9.67) and (9.68) with &(r) determined from (9.72). 


Flow Past a Slender Cone. 


For a cone with semiangle ε, S(x)=e’x? and the F function (9.69) is 
F(é) = 2¢€7é yee 
In that case the relation (9.72) between € and r for points on the shock is 


ἐξ) Ξ 5 κεν". 


The shock equation (9.73) reduces to 


x= Br- = Ker 
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This corresponds to a conical shock with a semiangle of 


2 
+1)°M® 
it 2S ) 


°" 8 (M2-1)°? ε΄. (9.14) 


The shock strength obtained from (9.68) is 


6 
P—Po _3v(ytW)M? ἃ (9.75) 
Po 2(M = 1) 

For a cone, dimensional arguments show that the exact solution is a 
similarity solution with the flow quantities functions of r/x. The exact 
nonlinear equations may then be reduced to ordinary differential equa- 
tions and integrated numerically. This is the famous Taylor-Maccoll solu- 
tion (1933), which was a landmark in the development of the theory of 
supersonic flow. The results (9.74) and (9.75) were deduced for slender 
cones within the similarity theory by Lighthill (1948). It provides a valu- 
able check on the more general approach for slender bodies. Numerical 
results show that (9.74)}+-(9.75) are very good approximations for cones up 
to 10° semiangle over a range of Mach numbers from about 1.1 to 3.0. 

For general slender bodies, these formulas give the initial behavior of 
the shock. It should be noted that whereas the disturbances near the body 
are O(e’), the shock strength is O(e*). This explains in a sense why it is 
missed in the linear theory. 


Behavior at Large Distances for Finite Bodies. 


According to (9.72), for points on the shock é—& as roo, where 
F(é))=0. Then (9.72) is asymptotically 


1/2 


ἕ 
μ()--[3 [Ἐξ )46) vs (9.76) 
0 
The shock is asymptotic to 


gee {2x f “F(@)d\r'/*— Ep (9.77) 
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and the shock strength is 


/2 


p-p M? (2 (ἢ a 
Fo πω, Mode} 
2124 τὰ tf 
= aa? f Fae} p-3/4, (9.78) 
Y 


This is the most important formula for sonic boom work. It shows that the 
boom at the ground depends very weakly on the Mach number, depends 
on distance like r~°/4, and depends on the body shape through the factor 


K= [ [ “Ρ()αε) ῷ (9.79) 


If the length of the body is / and the ratio of maximum diameter to length 
is the thickness ratio 6, the shape factor Kx 6/3/4, For a body shape 


aoa ('-(1-7) } 0<x</, 


δὶ, l<x, 


we find K = 1.046/3/4, 
The asymptotic wave profile is a balanced N wave. Between the 
Shocks £~£, F(§)~0, so that from (9.70) 
Br-—x+&) 
F)~—, 


and from (9.68) and (9.71) the pressure ratio is 


The flow behind the rear shock is not completely undisturbed but is of 
smaller order than the disturbance in the N wave. For this and other 
details reference may be made to the original account (Whitham, 1952). 
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Extensions of the Theory. 


Axisymmetric bodies might seem a far cry from real aircraft, but it is 
known that the far flow field in any direction away from a finite body can 
be represented as the flow due to an equivalent body of revolution. That is, 
in any direction the expressions (9.67)-(9.69) apply but the F function will 
be different for different directions. In the linear theory from which one 
starts, the contributions from fuselage, wings, lift distribution, and the hke 
can be treated separately and superposed to give the final F function for 
each direction. The nonlinear expressions then apply with this F function. 
The volume contribution is related to a distribution of cross-sectional area 
S(x), where the cuts are made by planes at an angle to the stream in 
accordance with the supersonic area rule. For details of the method and 
the nonlinear results, see Whitham (1956). When various protuberances 
such as wings are included S’(x) becomes discontinuous and (9.69) must 
be modified appropriately (Whitham, 1952). 

The effects of the lift distribution are of equal importance with the 
volume effects. In the linear theory the lift distribution L(x) provides a 
contribution 


x— Br 
Oo =— Ι cosa Oe) (9.81) 
emo  ὐ (x—n) — Br? 


to the velocity potential, where ὦ is the angle of a meridian plane through 
the flight path and is measured from the downward vertical. This may be 
approximated for (x— Br)/Br<l1 as before and the perturbations are 
again given by (9.67)-(9.68) with 


δ: ξ 
F()= 1 Bcosw L'(n) dn (9.82) 


2 — ΞΞ----- : 
27 Po U ὃ Vi-n 


This is an interesting illustration of the “equivalent body” concept for 
asymmetric distributions. It should be noted that the approximations 
(9.67)(9.68) are valid for £/ Br<1 and they are sufficient to determine the 
shocks. However, the pressure distribution behind the main N wave makes 
important contributions to the total lift transmitted to the ground. The full 
form (9.81) and, when necessary, its nonlinearization are required for a 
detailed accounting of the lift. This has sometimes caused confusion in the 
literature where it has been remarked that the pressure distribution given 
by (9.68), when integrated over the ground, does not give the total lift 
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fo L(x)dx. The expression (9.68) applies only in the region of the main N 
wave. The full formula derived from (9.81) integrates correctly to give the 
total lift. 

The remaining extensions to accelerating bodies and nonuniform 
atmospheres, the latter being always important in the real situation, can be 
handled analytically to some extent and the theory leans heavily on 
geometrical acoustics (see Friedman, Kane, and Signalla, 1963, and re- 
ferences given there). Further developments and comparisons with wind 
tunnel and observational data are reviewed in a series of papers published 
in the Journal of the Acoustical Society of America (1965). Similar compari- 
sons have been made by various government laboratories and aircraft 
companies. (A popular account for laymen which contains some interest- 
ing checks of the theory with reality is presented in Boeing Document 
D6A10598-1). The conclusion seems to be that the theory provides good 
results and valuable insight in an extremely complicated practical problem. 


CHAPTER 10 


Wave Hierarchies 


The study of a single set of hyperbolic waves, including the various 
effects of geometry, diffusion, and damping, has now been covered in 
considerable detail. To complete this first part, we discuss the situation 
when waves of different orders appear in the same problem. Typical 
examples arose in Chapter 3 and some preliminary comments were made 
there. In traffic flow, for instance, the equations 


p,+ (pv), =9, 
(10.1) 
7(v,+ vv,) + peat v—V(p)=0 


were proposed at a certain level of description. This system has two 
families of characteristics and the characteristic velocities are found to be 


y Ρ 
ot V\V— 5 Ό- γ- : (10.2) 


Consequently, waves with these velocities will have their important roles to 
play. Yet the reduced equations 


p,+(pv),=0,  v=V(p), (10.3) 


which are expected to be a good approximation for sufficiently small 
values of ν and τ, have a single family of characteristics and the 
characteristic velocity is neither of the two in (10.2); it is in fact 


V(p)+pV'(p). (10.4) 


If there is to be no inconsistency between the two levels of description, 
waves with velocity (10.4) must also play an important role in the solutions 
to (10.1), even though they no longer correspond to the characteristics. The 
aim here is to clarify further the roles of the “higher order waves” (10.2) 
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and the “lower order waves” (10.4), and to see how each set is modified by 
the presence of the other. 

We consider first the linearized versions of systems like (10.1), since 
general solutions to typical problems can be obtained by transforms and 
used to exhibit the salient features explicitly. Similar analytic solutions are 
rarely available for the full nonlinear systems, but the linear results may be 
used to infer the corresponding nonlinear behavior of the various waves 
and to suggest simplifying approximations for their description. 

When systems such as (10.1) are linearized it is more convenient to 
work with the equivalent single second order equation. It takes the general 
form 


rs eee a oe 


where the coefficients are constants and, for definiteness, we choose Cy > Cp. 
In different notation, this was (3.4) for traffic flow: c, and c, are the 
linearized forms of (10.2), namely the values in the undisturbed flow, and a 
is the linearized form of (10.4). For flood waves, the nonlinear system 


(3.37) is similar to (10.1). The characteristic velocities are o+ Veh, but 
the reduced system (3.38) shows also the presence of lower order waves 
with velocity 3v/2. The linearized equation (3.41) is in the same form as 
(10.5). For the chemical exchange processes formulated in (3.74), the linear 
equation is exact and is covered by (10.5) with one of the c’s zero. Other 
examples will be mentioned later. If the systems concerned are higher than 
second order, there will be corresponding increases in the number of 
factors that make up the operators in (10.5). 

The waves of different order are clearly displayed by the factored 
operators in (10.5). Indeed if the lower order terms were absent (η = 00), 
the general solution would be - 


P=P(x- Ct) + φκγί(α -- Cy). (10.6) 


Conversely, if the higher order terms were absent (n=0), the solution 
would be 


P= φρία -- αἰ). (10.7) 


The latter corresponds, of course, to the reduced level of description whose 
linearized version is the equation 


=0. (10.8) 
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Fig. 10.1. The (x,f) diagram for the initial value problem. 


Our questions concern the combined system, the different roles played by 
the waves in the two levels, and the modifications to (10.6) and (10.7). 

We can sketch out in advance what must happen. Since the 
characteristics of (10.5) are determined by the higher order terms, the first 
signals and wavefronts must travel with the velocities c, and c,. But to fit in 
with the reduced description, some of the disturbance must travel with 
velocity a. This is indicated in the (x,/) diagram in Fig. 10.1. As the 
parameter ἡ is reduced, the first signals must become small; the main 
disturbance must be moving with velocity a and be reasonably well 
approximated by (10.7). 

This picture makes sense only if a lies between c, and c,. But as we 
saw in Chapter 3, exactly this ordering of velocities is necessary for 
stability, and the stability condition ties in nicely with the ideas on 
propagation. One is tempted to say that the instability that arises when a 
does not lie between c, and c, is a consequence of an unresolvable 
competition between the two sets of waves. 

The question of appropriate boundary conditions also arises here, 
since the number of boundary conditions is determined by the number of 
characteristics pointing into the (x,t) region of interest. Since the number 
of characteristics can change in going from (10.1) to (10.3), or from (10.5) 
to (10.8), a rationalization of this apparent disagreement is also required. 
In view of the inequality 


C)>a>c, (10.9) 


required for stability, (10.5) can only require more boundary conditions 
than (10.8). When this is the case, there will be no inconsistency in the two 
levels of description if the additional information for (10.5) affects the 
solution only in a layer next to the boundary, which is thin for small ἡ, and 
outside this layer the solution of (10.5) is well approximated by a solution 
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of (10.8). The appropriate solution of (10.8) will satisfy only some of the 
boundary data, and the remaining adjustment to the additional boundary 
data occurs in the boundary layer. 

The details of all these arguments are substantiated from complete 
solutions of (10.5). The relevant ideas can then be taken over piecemeal to 
the nonlinear situation. In the nonlinear case, the possibility of shocks 


arises and different types of shock will appear as appropriate discontinui- 


ties in different levels of description. An understanding of the relations 
between them leads to a simple criterion to predict when the shock 
structure will still contain a discontinuity. Instances of this were noted in 
the inequalities (3.17) and (3.52). We shall now be able to give a more 
general view of these and add further examples. 


10.1 Exact Solutions for the Linearized Problem 


First we check the stability requirements on (10.5). An elementary 
solution is 
p = Ae ik οι 
provided that 


n(w— ke,)(w— kes) + i(w— ka) =0. (10.10) 


For relatively short waves, kc,n>1, we have 


- ¢,-a 
ep ey, 10.11 
NC) C4 ( ) 
ὠω-΄“--Ξ 
7 a— Cy 
| Ree 0 ς 10.12 
: NC, Co ( ) 


One or the other of these has positive imaginary part showing instability 
unless 


n>0, c,>a>cp. (10.13) 


It is then easily verified that §$w<0O for all Κα under these conditions, so 
they provide the complete stability requirements. We now suppose that the 
inequalities in (10.13) are satisfied and consider more general solutions. 

The main points can be made equally well on the solution of the 
initial value problem by Fourier transforms or the solution of the signaling 
problem by Laplace transforms. The latter is chosen since it offers a richer 
number of cases depending on the signs of c,, c,, and a. 
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Case c,>a>0, c,<9. 


This is the simplest case; with c, <0, only the c, family of higher order 
waves is generated and with a>0 there is no conflict in the number of 
boundary conditions posed on x=0. For (10.5), a well-posed problem is 
then 


p=9,=0, x>0, 1=0, 
(10.14) 
p=f(t), x=0, ¢>0. 


For the reduced equation (10.8), the initial condition g,=0 would be 
dropped, but in either case the solution remains identically zero in x >0 
for a finite time so the difference does not show up. The solution of (10.5) 
by Laplace transforms may be taken to be 


D( p,x)e?! 
w(ant)= zu; {ἘΞ ὦ t>0, (10.15) 
B® 


where & is a Bromwich path {p=constant to the right of all the 
singularities of the integrand in the complex p plane. On substitution in 
(10.5), we have 

NC C2P,x + {nc + c2)p + a}, + P(np + Ip =O, 


and the possible solutions for ᾧ are 


= F(p)e*?) + G(p)e*?2™, (10.16) 
where P,, P, are the roots of 
NC\CoP*+ {n(c, + c,)p + a}P+p(np + 1)=0, (10.17) 
and F, G are arbitrary functions. For large p, 
Ρ 


In this first case, with c,>0, c,<0, the second term in (10.16) is un- 
bounded for large Rp, so we must take G(p)=0; this term would 
correspond to incoming waves with velocity c,<0 and is excluded. The 
remaj .ing function F(p) is determined from the single boundary condition 
gm=/ δὴ at x=0. In fact the requirement is simply that F(p) be the Laplace 
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transform of f(t). The final solution therefore is 


F 
p= ae AP) eps r.dp (10.18) 
B 


where 


F(p)=pf “fe” dt, 


ΓΕ a we ται 
φῇ 


(10.19) 
2πὶ Ρ 


and P,(p) is the root of (10.17), which is asymptotic to —p/c, as p00. 

When ¢—x/c,<0, the contour can be closed by a large semicircle to 
the right to show that φεῦ. Thus the wavefront is x — c,t=0. The behavior 
of » near the wavefront is obtained from the more detailed asymptotic 
behavior of the integrand of (10.18) as poo. If the contour % is taken far 
enough to the right, we may substitute the expansion 


in (10.18) and deduce the approximation 


φ-ήι- = )exp| - eee |. (10.20) 


Cy— Cy Cy 


This is in fact the first term in the geometrical optics expansion (cf. Section 
7.7); further terms in the series are obtained by continuing the expansion 
of e* for large p. The general form can be found by substitution of the 
geometrical optics expansion directly in (10.5), but (10.20) also relates the 
function of t—x/c, to the boundary condition. The expression (10.20) is 
valid near the wavefront. It shows that the first disturbance propagates 
with the c, waves, but this disturbance damps out exponentially and 
becomes negligible in a distance of order c,n. As n->0, this disturbance 
becomes negligible for all x >0, in agreement with the reduced description. 

We ask, then, where the main disturbance described by (10.18) is to be 
found. To obtain this information, we investigate the behavior on the 
family of lines x /t=constant in the (x,t) plane, since each one of these 
represents the path of a wave moving with constant velocity. We shall need 
to be reasonably careful about the various limits involved and, accord- 
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ingly, introduce nondimensional quantities 


Xx 
q=np, Q(q)=nc,P\(P), ara 


In general, the boundary function f(t) will introduce another time scale T, 
say, and F(p) will take the form 


T 
F(p)= aC a ) 
Then (10.18) becomes 


(qT 
P= al El ela mOt/Ndg, (10.21) 
πὶ )ς 4 


where Ο(4) is the appropriate root of 
c ς 
-Ξρ3- [{:- 2 q+ a orate 1) =0. 
Cy Cy Cy 


We now consider the asymptotic behavior of (10.21) as t/n-—0o, with m 
fixed. According to the saddle point method, the dominant contribution 
comes from the neighborhood of the point g= q* for which 


d ΜΕ 
that is, 
1+mQ’(q*) =0. (10.22) 


The first term of the asymptotic expansion is found by deforming the 
contour into the path of steepest descent ὁ through g=q* and expanding 
g+m0O as far as quadratic terms in 4 -- q*. Thus we have 


pene“ (ot+ moCo")) 


$ (qT 
x = i “ne exp| 5 mo" (ar(a-a"Y ha (10.23) 


as 1 / NO. 
In the usual application of the saddle point method, the remaining 
part of the integrand would also be expanded in Taylor series about g=q* 
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and &(qT/7)/q would be replaced by ¥(q*7T/n)/q*. This further step 
would be valid for the limit t/n->00, T/7 held fixed, and is relevant when 
t>>n, t>>T. But we are interested in the case t>n, T>n, independent of 
the magnitude of t/T. To include this case, the possibility T/n->0o must 
be allowed in (10.23) and the more general form must be retained. 

In discussing the behavior of (10.23), it is convenient to revert to the 
original variables. We have 


F(p) 


Ι ae 
p~exp{ip* + xP,(p*)} 7 — [ exp| 5 ΧΡΙ(»" -»"}}Φ, (10.24) 
ΘΟ 


where p* is the function of x and ¢ determined by 


t+ xP;{(p*)=0; (10.25) 


this provides the asymptotic behavior of φ as t/n->00 keeping x / c,t fixed. 
For simplicity we shall assume that /(°f(t)dt is convergent so that F(p) /p 
is finite as pO and there is no pole. [The case in which f(t) approaches a 
constant as too is of interest, but this is most easily handled by reformu- 
lating the problem in terms of g,.] The asymptotic expression (10.24) is 
itself dominated by the exponential factor outside the integral. The ex- 
ponent is stationary when 


0 
3x {ΠΡ +xPi(p*) } =0, 


and, in view of (10.25) which determines p*(x,t), this condition reduces to 
P,(p*) =0. 


From (10.17), P,(p*)=0 must correspond to either p*=0 or p*= — 1/n, 
and it is a simple matter to check that p*=0 is the correct choice for P). 
Therefore the exponential factor in (10.24) has a stationary value, in fact a 
local maximum, for those x and ¢ which give p*=O as the solution of 
(10.25). Thus the maximum is found on 


t+ P/(0)x=0. 


One checks from (10.17) that P;(0)=—a~!. Hence the maximum of the 
exponential factor is to be found on 


x=at, 
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and the maximum value is one. The disturbance is exponentially small (in 
this limit) except in the neighborhood of x=at; this result shows that the 
main part of the disturbance eventually travels with velocity a. Since the 
approximation is for t>>n, the result applies increasingly earlier as n—0. 

We can extract further information on the behavior of the main 
disturbance. In the neighborhood of x = αἱ, the corresponding values of p* 
are small. The details of the disturbance may then be obtained by making 
further expansions of (10.24) about p*=0. But we would then have made 
an approximation to (10.18)in two stages: first an expansion of pt+ P,(p)* 
about p=p* and then an expansion of the resulting expression about 
p*=0. Obviously, the final result must be included by simply expanding 
pt+ P,(p)x about p=0. We have - 


OT Ξε τς + 
Hence 
es A) " | (:- Ὁ). ἘπΘ ἐΕ ΘπῚῳ (10.26) 
Θ 


applies in the neighborhood of x -- at=0 as t/n—>00. The first approxima- 


tion is just 
F(p) 
ePtt— x/a) gy -ή:- ~), 
P~ ay Ρ a 


which is exactly the prediction of the lower order equation 10.8. Thus the 
lower order formulation is shown to give a correct description of the main 
disturbance. 

To see the effect of the quadratic term in the exponential in (10.26), it 
is more instructive to find the equation satisfied by (10.26) rather than to 
intepret the integral. The expression is, in fact, identical with the solution 
of the equation 


n(c,—a)(a-¢) 


Pit 
az 


q, + ag, = ( 10.27) 
for the same boundary condition = f(t) on x =0. The right side of (10.27) 
is already a small correction (of order n/t compared with the other terms), 
50 it is consistent to use the first approximation.d /dt= — a(d/0x) in it and 
to take an equivalent form 


φ, + ay, = nC, — a)(a— C2), (10.28) 
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This is more familiar; it shows that the main part of the disturbance 
propagates with velocity a and is diffused by the effects of the higher order 
terms in the equation. But the latter effect is small when 7 is small. To 
Summarize then in the case c, >a>0, c,<0: The first signals propagate 
out with velocity c, but damp out as shown in (10.20). The main distur- 
bance lags behind and moves with the lower order wave speed a. In this 
case there is no disagreement in the number of boundary conditions to be 
prescribed at x =0; p=/(‘) is appropriate for both (10.5) and (10.8). After 
a time of order ἡ, the first signals are exponentially small and the main 
part of the solution to (10.5) is well described by (10.8) using the same 
boundary condition at x=0. The effect of the higher order terms is to 
produce a diffusion of the lower order waves as shown by (10.28) but this 
is small when ἡ is appropriately small. 


Case c, >0, c,<a<0. 


In this case the maximum value of the exponential factor in (10.24) 
still occurs at x=at, but with a<0 this is outside the region x >0. The 
expression (10.24) is exponentially small throughout the region x >0. The 
saddle point formula does not apply at x=0, since from (10.25) there is 
clearly no saddle point, but it is easily shown from (10.18) that the solution 
falls exponentially from the value p= f(t) on x =0, and the disturbance is 
confined to a boundary layer of thickness 7 7 1. In this case the first signal 
decays exponentially and the main disturbance does not propagate. 

The reduced equation (10.8) does not permit data to be specified at 
x =0, and its solution is p=0. This agrees with the preceding description 
outside the boundary layer, and the difference in boundary conditions is 
accommodated by the boundary layer. 


Case c,>a>c,>0. 


In this case both characteristics of (10.5) point into the region x >0, 
and it is appropriate to give the two conditions 


p=f(t), 9,=a8(t) forx=0, ἐὺ, (10.29) 


However, we note that only one of these or possibly a combination of 
them could be posed with (10.8). The two conditions (10.29) correspond 
precisely to the fact that with c, and c, positive, both terms in (10.16) must 
be retained and there are two arbitrary functions to be determined. If f(p) 
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and g(p) are the transforms of f(t) and g(t), then the arbitrary functions in 
(10.16) are determined by 


F+G=f, P,F+P,G=2. (10.30) 


The discussion of the term (10.18) in the solution is exactly the same 
as before with the same conclusion that the first signals travel with velocity 
c, but are damped out; the main disturbance travels with velocity a and is 
diffused by the higher order effects. The main disturbance is again well 
described by (10.8) and the only new question concerns which boundary 
condition is in fact adopted. The function F(p) which appears in the 
corresponding solution (10.26) is obtained from (10.30) as 


P,f-& 
P,— P, 


F= (10.31) 


But when (10.26) applies, P, is approximated for small values of np, and 
P, must be approximated in the same way. It is easily seen from (10.17) 
that 


NP a 
ΡΞ -- τὴν + O(n’p’), η,.--- Pas + O(np); 
152 


hence (10.31) reduces to 
F=f 


in this approximation. Therefore the boundary condition gp = /f(¢) is in fact 
satisfied and must be used with the reduced equation. 
The second term in the complete solution is 


G 
= AP ere Mey (10.32) 
B 


Since P,~ — p/c, as p>, this expression is zero for x > cyt, correspond- 
ing to the second wavefront provided by the waves with velocity ὁ. It 15 
easily shown, as in the earlier case, that these waves are damped exponen- 
tially and become negligible when x/c,n>1. The saddle point investiga- 
tion then shows that the contribution of (10.32) is small except near x =0. 
Near x =0, we may use the asymptotic expansion for 


ee. —~_ fixed 
(γῆ 
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to see how the solution behaves. According to Watson’s lemma, this is 
found from the behavior of the integrand in (10.32) for small np. We have 


Py»~- 


~, Px\CyCo7 
ΠΟ ~-( gue | 
ἜΤ (p) g af 


Therefore (10.32) is asymptotic to 


τ nexp( — ἐν (10.33) 


Ο1Ο) ἢ 


l , 
-{e(+ir(n} 
Thus the first contribution from (10.32) travels with the c, waves but 
damps out and its main contribution is the boundary layer given by 
(10.33). | 

A composite solution for t/7>>1 is obtained by adding the two main 
contributions to give 
C102 


φεήι- Σ)- [εκ τ ὩΣ Ὸ} "τ ἠποχρί- 2 =) (10.34) 


This satisfies both boundary conditions to the first order. The second term 
is needed to satisfy the second boundary condition, but it decays rapidly 
away from the boundary in a layer whose thickness is of order NC\C,/ a. 
The various results are conveniently represented in an (x,t) diagram as 
shown in Fig. 10.2. 


boundary 


layer 


Fig. 10.2. The (x,f) diagram for the signaling problem. 
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10.2 Simplified Approaches 


Other linear equations with constant coefficients can alwa s be 
treated in similar fashion by transforms and suitable asymptotic ex- 
pansions. However, the details are tedious and the main ingredients in any 
solution can be seen very simply, and with greater insight, by more 
intuitive arguments. We indicate the techniques on the previous problem. 

First, in any wave profile moving approximately with speed V, the ὦ 
and x derivatives are related approximately by 


(10.35) 


We can use this in (10.5) to examine the waves moving with velocities 
C\,C>,a, in turn. For c, waves, we use 0/dt=—c, 9/0x in all derivatives 
except the sensitive one where the factor 0/dt+c, 0/90x itself appears. We 
have 


ὃ (9 ὃ 0 
η(ο,-- ον a [Ὁ τοι δι |p (a— 0) τς = 


The 0/0x operator can be integrated out without loss since it corresponds 
to the remnants of the other waves. Accordingly we take 


oy ὃφ ογπτα 

a> Ἔσ,--Ἐπ---------φτῦ. 10.36 
δι 1 Ax m(¢;—¢) 
The solution is exactly the one found in (10.20). Similarly for the c, waves, 
we have 


the solution is similar in form to (10.20) and may be verified in detail from 
(10.32). 

For the lower order waves propagating with velocity a, we use 
d/dt=—a 0/dx in the second order terms in (10.5) and the approximate 
form is 


φ, + ay, = (Cc, — a)(a— “)φ,.- (10.37) 


This in exact agreement with (10.28). If the ¢ derivatives are preferred in 
the second order term we obtain the alternative form (10.27). 
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The possibility of boundary layers near x =O may be investigated in 
the same spirit by arguing that x derivatives will be much larger than ἢ 
derivatives so that the approximation 


— > (10.38) 


should be made. This may be interpreted as the special case of (10.35) with 
V =0; it corresponds to nonpropagating waves. Under this approximation, 
(10.5) is reduced to 


NC {CoP + AP, =O (10.39) 
and the general solution 1s 


p= A(t) + B(t)exp(- =), (10.40) 
Ο10 ἢ 
in agreement with (10.34). Of course the exponential solution is omitted 
unless a/c,c,n>0 and it is only in the case of exponential decrease that 
the existence of a boundary layer is inferred. 

For initial layers, where data for m and q, are imposed at t=O in the 
full equation, we consider the form of the equation under the approxima- 
tion 0/dt>0/0dx. For (10.5) we have 


Nut G=0, p=C(x)+D(x)e/". 


This shows how the solution adjusts to the approximate form where only 
the initial value of φ 15 relevant. 

These techniques allow a quick assessment of the various regions of 
interest and of the relevant approximations. They are easily made the basis 
of more formal perturbation procedures. For example, the straightforward 
expansion 


P= Yo( x,t) + ηφι|(χ,1) + n°, (x,t) ++ °° 
gives (10.8) for φρ; the expansion 
P= φρίξ,1) +0'p, (Et) Ἐ τ... 
ἔπη "(x= αἱ) 
leads to (10.28) for φρ; the expansion 
P=H(X,1) +09 (Xt) Ἐ τ... 
X= 'x 


leads to the boundary layer equation (10.39) for @. 
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10.3 Higher Order Systems, Nonlinear Effects, and Shocks 


For a nonlinear system of equations which exhibits plane waves of 
different orders, we do not normally have the luxury of complete exact 
solutions and for analytic work have to rely on the counterparts of the 
approximations in the last section. Just how to proceed in detail will vary 
from problem to problem but we may note a few guidelines. For ease of 
reference suppose we call the complete set of equations the system I and 
the reduced set obtained by setting some parameter ἡ equal to zero the 
system II. The theory of characteristics will provide the characteristic 
velocities c,,..., c, for I, and characteristic velocities a,,...,a,, (m<n) for 
II. In general, for nonlinear problems these will be functions of the 
dependent variables. However, the linearized theory for small perturba- 
tions about some uniform state will be useful to set the scene and give 
information about stability. If just two orders are present the linearized 
theory for plane waves in a uniform medium may be reduced to a single 
equation 


3 ὃ 9 ὃ 3 ὃ ὃ ΕΒ ΤῊΞ 
(10.41) 


for some perturbation potential φ, where the propagation speeds now take 
their constant values in the uniform state. The standard stability argument 
turns up the interesting result that the orders must satisfy either m=n— Ἰ 
or m=n-—2 for stability. In the first case the complete requirements are 


m=n-l, 7 20, Cp > a, > Cy > An°** Oa, 1 > C. (10.42) 


These are just the conditions that allow a satisfactory interpretation of how 
the solutions to II approximate to the full set I. The second case, m=n — 2, 
introduces effects more typical of dispersive waves and its discussion is 
relegated to the second part of the book. (The stability conditions for this 
case were given by Wu, 1961, correcting an earlier misstatement by the 
author that (10.42) was the only case to consider.) 

Equation 10.41 may be solved by transforms, but the overall picture 
may be outlined by the techniques of the last section. Each of the c,; waves 
satisfies an approximate equation 


n(p, + ¢P,) + Ύ φ = 9, (10.43) 


where y, is determined in terms of the a’s and c’s similar to (10.36). The 
stability conditions ensure that y,>0, so we have exponential decay. 
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Similarly, the a, waves satisfy an approximate equation 
φ, + ἀφ, = ηα;φ,.: (10.44) 


analogous to (10.28), and a, >0. 
When nonlinear effects are included, we should aim toward an equa- 


tion of the form 
φ, + ο(φ)φ, + Bp =0 (10.45) 


to replace (10.43). Within the waves of any order there is now nontrivial 
interaction between the different waves and extracting (10.45) would 
require something akin to the “simple wave argument.” In nonlinear 
problems it would normally be more convenient to work with n dependent 
variables and a system of first order equations. The technique of using 
d/dt~—c,d/dx for c; waves is roughly equivalent to taking the (n—1) 
Riemann variables of the other (n— 1) c waves to be constants. 

Equation 10.45 can then be solved by the methods of Chapter 2. 
Waves carrying an increase of ο(φ) may break [see the discussion of (2.72)] 
and shocks will be required. These shocks are discontinuities of the system! 
and the shock conditions are deduced in the standard manner from the 
relevant system of conservation equations. For 8, >0, even when shocks 
are produced the disturbance normally decays and the main disturbance is 
eventually carried by the lower order waves. 

If 7 =0, a nonlinear simple wave on the a, characteristics will satisfy 
an equation of the form | 


φ, Ἔα (φ)φ, =. (10.46) 


Shocks required in the solution of this equation will satisfy the discontinu- 
ity conditions derived from the reduced system II and will be different 
from the shock conditions for system I. | 

When higher order diffusive effects are introduced one should aim for 
an equation ; 


φ, + OP) Px = Yi; Prax (10.47) 


corresponding to (10.37). This can be related to Burgers’ equation for a,, 
within the same type of approximation, and analyzed from the results of 
Chapter 4. 
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10.4 Shock Structure 


A shock of system II, let us call it an S,,, will be smoothed out to some 
extent when viewed within system I. It is just the shock structure problem. 
However, we can now comment more decisively on the occurrence of 
discontinuities in the structure. An S,, will always be associated with one 
particular family of a waves, and each family of a waves is sandwiched 
between two families of c waves. But the S,, shock will go faster than the a 
waves ahead of it and slower than the a waves behind it. Even in a stable 
situation, it may overtake the next c wave ahead or lag behind the c wave 
following. This would violate the characteristic property of the c waves if 
the solution remains continuous. But S, discontinuities can do this. There- 
fore, whenever this situation occurs, the S$), profile will require discon- 
tinuities in the shock structure. These discontinuities are S,’s satisfying the 
S, jump conditions, and we may regard the complete profile as a combined 
S,-S}, wave. 

If the velocity of an S,, associated with the a, family is U,, and if 
superscripts 1 and 2 now denote the values of quantities ahead and behind 
it, the criterion for a continuous shock structure is 


ἐ <U,<c}. (10.48) 


If this is violated, there will be a discontinuity in the profile, and this may 
be the case even though the states on the two sides are stable. If we 
consider a nonlinear version of Fig. 10.1, the main disturbance remains 
continuous provided its velocity is well away from the higher order wave 
velocities on the two sides. 

In the nonlinear case all these velocities have a certain range of values, 
50 it is possible for the main wave to combine with the c waves ahead or 
behind. When this happens the exponential decay of the c waves ceases 
and, since the c waves break, an S, shock is picked up in the shock 
structure. The criterion (10.48) on the velocities gives a very simple method 
of prediction which avoids the much more involved analysis of the integral 
curves for the shock structure equations. Its use will be noted in the 
following examples. 


10.5 Examples 


Flood Waves. 


This case was studied in great detail in Chapter 3. We note that, with 
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the current notation, 


c=otVegh, C=o0+Vagh, 


a= 


- 


According to (10.42) a uniform flow h=hy,v=vg is stable (as noted in 
Section 3.2) provided 


; 300 - 
Up— Vg'ho Se POF gho , 


the lower limit being no restriction. According to (10.48) the S|, shock 
structure will be continuous provided 


pi? — gh <U<v) 4yg/n 


The δ᾽, shock conditions show that U>v, so the lower inequality is 
always satisfied. The criterion for the appearance of a discontinuity in the 


Sj, structure is 
U>v? +yg'h® . 


This is precisely the result found in (3.52) from the detailed analysis. 


Magnetogasdynamics. 


The equations for magnetogasdynamic waves are given in Examples 
10 and 10’ of Section 5.2. The first set is the system I and we have 


- 172 
C=(€Eou) ©, Co=uta, cz=U, 


C4=U— a, ὃς — (€gp4) 


The second set is the system II and 
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The genuine waves moving relative to the fluid have alternating velocities 
as required by (10.42) and are stable. The confluence of a,,a3, with c; on 
the particle paths is easily checked to be a stable situation. 

An Sj, shock of the a, family moving with velocity U has a continuous 
structure provided 


u +a <U< (equ) 
The speed of light is effectively infinite, so we deduce that a discontinuity 
appears at the back of the profile when 
U <u + a, 


This is a simple derivation of the result found by Marshall (1955) by a 
detailed analysis of the shock structure. Further discussion of this case may 
be found in Whitham (1959a). 


Relaxation Effects in Gases. 


The equations of inviscid gas dynamics (Chapter 6) may be written 


p, + up, + pu, =9, 


u, + uu, + τι, 


e +ue,+—u,=0. 
ρ 


During rapid changes in the flow the internal energy e may lag behind the 
equilibrium value corresponding to the ambient pressure and density. The 
translational energy adjusts quickly, but the rotational and vibrational 
energy may take an order of magnitude longer. If we suppose that a of the 
degrees of freedom adjust instantaneously but a further a, degrees of 
freedom take longer to relax, we may take 


ἐπ +E, 


where E is the energy in the lagging degrees of freedom. Jn equilibrium [see 
(6.42)} E would have the value 
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A simple overall equation to represent the relaxation is 


a, p 
Ε, ἘΜΙΕ, Ξ --τ Ε--τ- — |; 
2 p 


where 7 is the relaxation time. After some minor manipulation, the set of 
equations may be written 
p, + up, + pu, =0, 


Ϊ 
u, + uu, + po 
} P 
© (p,+ up.) — (1+ 5) 5 (0, +up,) +p E, + uE,) =0, 


Qa 
E,+uB,+1( E- “ ᾿ =0. 
p 
The characteristic velocities are 
Cc, =urt a, Cr = C3, = 4, C4=U— Ay, 


where a, is the “frozen” sound speed defined by 
2 2 \? __?P 

ajp=(1+—]-—=y,-. 
7 ( alp “Ip 


This is the system I for this case. However, if the relaxation time 7 is 
taken to be so short that E=(a,/2)(p/p) is an adequate approximation to 
the last equation of the set, we have the equilibrium theory: 


p, + up, + pu, =0, 
Ϊ 
u, + uu, + πο 


ata, 


at a, P 
(p.+ wp.) — (1+ 5 JE (o,+ up.) =0. 


2 


This is the reduced system II for our problem. The characteristics are 


a,=ut+a, =u, a,=u-4,, 


where a, is the equilibrium sound speed defined by 
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Since y,<y,, the various velocities alternate and there is stability; again 
confluence of velocities with the particle velocity is stable. 

Viewed from the full system the S,, shock structure, which takes the 
flow between two uniform states in equilibrium, will be continuous if 


u? — a <U<u + a. (10.49) 


Since the S,, shock conditions show that U >u®, only the upper restric- 
tion arises. A frozen S, shock will appear at the front of the profile and will 
be followed by a continuous relaxation region when 


U>u) + ap”. 


This criterion may be written as 


(1) 1/2 
U-u"™ yu? cts -( 4 


(10.50) 
Ye 


«(Ὁ 

For a diatomic molecule the two rotational modes may lag behind the 
three translation modes, and we may describe this by taking a=3, a,=2._ 
The frozen and equilibrium sound speeds are given by 


The criterion (10.50) predicts a fully relaxed smooth profile for 
M <1.091, 


and a discontinuity followed by a relaxation region when M exceeds this 
critical value. The discontinuity would itself be resolved into a thin 
sublayer by the inclusion of viscosity and heat conduction. 

Griffith, Brickl, and Blackman (1956) and Griffith and Kenny (1957) 
report on experimental observations for vibrational relaxations in CQ,. 
The appropriate choice for α is 5 to include translational and rotational 
modes. The vibrational modes take considerably longer to adjust. At 
300°K, the appropriate choice* for a, is 2. The critical value for M is 
1.043, and the experimental observations in the references cited bear this 
out very accurately. (Further details may be found in the above papers and 
in the excellent article by Lighthill, 1956.) 

We see that a careful assessment of the waves involved and the roles 
they must play leads to relatively simple predictions of important 
phenomena in quite complicated situations. 


*At this temperature the four modes have only one half of their classical energies so that 
a,=2 15 appropriate. 


Part II 


DISPERSIVE WAVES 


CHAPTER 11 


Linear Dispersive Waves 


The discussion in Part I was concerned primarily with hyperbolic 
systems. Most wave motions, including the familiar one of water waves, 
are not described by hyperbolic equations in the first instance. At a later 
stage there is some link-up with hyperbolic equations in describing the 
propagation of important average quantities associated with the distur- 
bance. But a different set of basic ideas and mathematical techniques must 
be developed. 

These nonhyperbolic wave motions can be grouped largely into a 
second main class which we call dispersive. In general the classification is 
less precise than that for hyperbolic waves, since it is made on the type of 
solution rather than on the equations themselves. But it may be made 
precise in a restricted class of problems, and one can make extensions in a 
natural way or proceed by analogy. It should be added that a few special 
equations exhibit both hyperbolic and dispersive behavior, the different 
behaviors appearing in different regions of the solution. But these are 
exceptional. 

The first two chapters develop the general ideas for linear systems. 
Chapter 13 is devoted to water waves, a subject which is fascinating for its 
own sake as well as being the subject in which many of the ideas of 
dispersive waves were first developed. In Chapter 13 a start is made on 
nonlinear dispersive waves in this specific context to serve as background 
for the general developments of nonlinear theory in Chapters 14 and 15; 
Chapter 16 contains miscellaneous applications. The final chapter covers 
the recent work on solitary waves and special equations. 


11.1 Dispersion Relations 


For linear problems, dispersive waves usually are recognized by the 
existence of elementary solutions in the form of sinusoidal wavetrains 


(x, t) = Aem*— ie", (11.1) 
363 
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where « is the wave number, ὦ is the frequency, and A is the amplitude. In 
the elementary solution, x,w,A are constants. Since the equations are 
linear, A factors out and is arbitrary. But to satisfy the equations, « and w 
have to be related by an equation 


G(w,«) =0. 


The function G is determined by the particular equations of the problem. 
For example, if φ satisfies the beam equation 


2 = 
Put ἜΝ τ 0, 
we require 


w? — y*«*=0. 


The relation between w and κ is called the dispersion relation, and as will 
become evident below, we can dispense with the equations once we know 
the dispersion relation; conversely, we can construct the equation from the 
dispersion relation. 

We assume that the dispersion relation may be solved in the form of 
real roots 


w= W(t). (11.2) 


There will be a number of such solutions, in general, with different 
functions W(«). We refer to these as different modes. The beam equation, 
for example, allows two modes: 


w= yK?, w= — γκῶ. 
For the present we study one mode; in linear problems the modes can be 
superposed to make up the complete solution. The linearity also allows us 
to work with the complex form (11.1) with the understanding that the real 
part should be taken when necessary. The actual solution is 


Rp=|Alcos(ex—wt+n),  n=arga. 
The quantity 
6=K:x—wt (11.3) 


is the phase; it determines the position on the cycle between a crest, where 
Ug is maximum, and a trough, where “q is a minimum. In this plane 
wave solution, phase surfaces = constant are parallel planes. The gradient 
of θ in space is the wave number «, whose direction is normal to the planes 
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and whose magnitude x is the average number of crests per 27 units of 
distance in that direction. Similarly, —@, is the frequency w, the average 
number of crests per 27 units of time. (The normalization to 27 units is 
convenient in working with the trigonometric functions.) The wavelength is 
A\=22/« and the period is T=27/w. 

The wave motion is recognized from (11.3). Any particular phase 
surface is moving with normal velocity w/« in the direction of κ. We 
therefore introduce the phase velocity, 


aches 
c= a, (11.4) 


where « is the unit vector in the « direction. For any particular mode 
w= W(k), the phase velocity is a function of x. For the wave equation 
φ,, Ξ εὐ νῳ, the dispersion relation gives w= +cyk and c= +c,: the phase 
velocity agrees with the usual propagation speed. In general, c is not 
independent of «x. Different wave numbers will lead to different phase 
speeds. This accounts for the term “dispersion.” In a Fourier synthesis of 
more general solutions by superposition, components with different wave 
numbers disperse as time goes on. This crucial process is discussed in 
detail in the next section. 

As regards classification, we must exclude the case c=constant from 
the class of dispersive waves, since the dispersion would be absent in that 
case. It is also clear that the solutions (11.2) must be real for these effects. 
The heat equation y,=V’p has solutions (11.1) with w=— ix”, but the 
solutions are not wavelike. To eliminate the unwanted cases, we restrict the 
term dispersive, in the first instance, to those cases for which 


W(«) is real, and 


£0. 
For one dimensional problems, the second condition is just 


W(x) #0. 


This is slightly stronger than c'(k) £0, since it eliminates W=ax+b as 
well. The reason for this is amplified below, but for the one dimensional 
case we note in advance that the group velocity W’(x) is the more 
important propagation velocity and the condition W(x) #0 ensures that it 
is not a constant. We can see directly that the excluded case W=ak + ὁ is 


(11.5) 
ὃ 
OK, OK; 


eee | 


determinant 
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not really dispersive. The elementary solution is 


en ibt, ix(x— at) 


and Fourier superposition gives the general solution 
e f(x -- αἱ). 


An initial waveform f(x) is modified as it propagates but there is no 
dispersion. One can easily show that the governing equation is hyperbolic. 

The determinant in (11.5) may vanish near some special value of κ, for 
example, as κ--ῦ or x—> 00, and these limiting values will require special 
consideration as singularities in the general formulas. 


Examples. 


Some typical examples which will be used as illustration in the 
development of the general theory are: 


φ,- 0° V*p+ B*p=0, w= +Ya7x?+ PB? ; (11.6) 


9,-C2Vp=B?Vy,, w= +——, (11.7) 
V1+ Bx? 

Pn + Y Prax = 0, w= + γκῦ; (11.8) 

φ, + ag, + BY, =9, = ak — Br?. (11.9) 


The first of these is hyperbolic but has dispersive solutions satisfying 
(11.5) nevertheless. It represents vibrations for a displacement g with 
additional restoring force proportional to g; it is also the Klein-Gordon 
equation of quantum theory. The other equations are not hyperbolic, and 
these are the more typical cases. Equation 11.7 appears in elasticity for 
longitudinal waves in bars, in water waves in the Boussinesq approxima- 
tion for long waves, and in plasma waves. Equation 11.8 is the equation for 
flexural vibrations of a beam. Equation 11.9 also applies to long water 
waves; it is the linearized form of the Korteweg-deVries equation. The 
water wave approximations will be investigated in detail later; the others 
are taken to be reasonably familiar. 
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Correspondence Between Equation and Dispersion Relation. 


It is obvious from these examples, and clearly true in general, that 
equations with real coefficients will lead to real dispersion relations only if 
they consist entirely of even derivatives or they consist entirely of odd 
derivatives. Each differentiation brings out a factor i. Even derivatives will 
lead to a real coefficient, odd derivatives to a pure imaginary coefficient; 
they cannot be mixed if the final form is to be real. Schrédinger’s equation, 


0 2 
ΠΩ RL (11.10) 


with mixed odd and even derivatives, has a real dispersion relation 


_ Wx? 


hw = en 


by allowing a complex coefficient. 

We can take the correspondence between the equation and the disper- 
sion relation much further. A single linear equation with constant 
coefficients may be written 


d 0 90 9 
ς εὐ ἐσ QO: 
P| dt’ dx,’ Ox,’ OX; \s 
where P is a polynomial. When the elementary solution (11.1) is substi- 


tuted in the equation, each 0/odr will produce ἃ factor — iw, and each 
d/ 0x, produces a factor ix, The dispersion relation must be 


P( — iw, ik, ik, ik,) Ξ50, (11.11) 


and we have a direct correspondence between the equation and the 
dispersion relation through the correspondence 
—<>— 10), — <>IK;. 


OL Ox, J 


J 


From (11.11) we can recover the equation. This is the basis for the earlier 
remark that the equation can be dropped when the dispersion relation 15 
known. 

It is seen, however, that an equation of this type can yield only 
polynomial dispersion relations. A natural question to ask is what kind of 
operators yield more general dispersion relations. One possibility is that 
the oscillatory wave motion represented by (11.1) takes place in only some 
of the space coordinates while there is a more complicated behavior in the 
remaining coordinates. A typical example is the theory of deep water 
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waves in which waves propagate horizontally, and the dependence on the 
vertical coordinate is not oscillatory; this will be seen later. A second 
possibility, which has wavelike behavior in all the variables, can be 
illustrated in one dimension by the integrodifferential equation 


0 ἧς 0 
με [-κα-τθγείξηαετο, (11.12) 


where the kernel K(x) is a given function. This equation has elementary 
solutions g = Ae'*~' provided 


- ine’ + [ K(x—&)ixe™dé=0. 
The condition can be rearranged as 
a wW a = — ἱκξ 
c= 4 J ke dé. (11.13) 


The right hand side is the Fourier transform of the given kernel K(x) and, 
by the inversion theorem, we have 


K(x)= ig J «(εἶπάς (11.14) 


Thus (11.12) can be constructed to give any desired c(k) and consequently 
any desired dispersion function: simply choose K (x) as the Fourier trans- 
form (11.14) of the desired phase velocity c(«). In particular, if 


then 


K(x) = ερδ(.) -- οὐδ΄ (Χ) --- +(- 1)"c,,6 2x), 


and (11.12) reduces to the differential equation 


dp dg ὃ 3p we ita 
ὭΣ ἐῶ τα δ ἃ, Com ama "Ὁ" 


When c(k) is a more general function with an infinite Taylor series in 
powers of κ, we can take the corresponding differential equation with an 
infinite series of derivatives. This is effectively summed by (11.12). 
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Definition of Dispersive Waves. 


We can now introduce a restricted definition of dispersive linear 
systems as those for which there are solutions (11.1), (11.2), with (11.5) 
satisfied. There is some overlap with hyperbolic systems as the example 
(11.6) illustrates, but the system is usually not hyperbolic. We need not 
confine the considerations to differential equations, as the last paragraph 
shows. 

It is immediately clear that the definition is too restrictive. Even for 
linear differential equations, it is limited to constant coefficients. For 
example, if y is a function of x in the beam equation, that is, 


Pu + Y?(X) Pexxx = 0, 


(11.1) is not a solution. Yet unless y(x) is a particularly violent function of 
x, we would expect the solution to have many features similar to the case 
of constant y; in some sense the structure is the same. We would think of 
this as a problem of dispersive waves in a nonuniform medium. Again, an 
equation may have separable solutions, say, 


X(«x)e~', w= W(Kk), 


where X is an oscillatory function such as a Bessel function. This would be 
dispersive in a similar way, but it would be hard to include in an overall 
definition. We seem to be left at present with the looser idea that whenever 
oscillations in space are coupled with oscillations in time through a 
dispersion relation, we expect the typical effects of dispersive waves. 

The situation is similar for nonlinear systems: a restricted class can be 
identified and then the ideas are extrapolated in a natural way. 

A more comprehensive answer lies perhaps in the variational formula- 
tions to be developed in the later chapters. These allow the theory of the 
solutions to be developed in a general way, and presumably they provide 
the appropriate general framework for many questions, including the 
classification. But this is still an open question. 


11.2 General Solution by Fourier Integrals 


If (11.1)-(11.2) is an elementary solution for a linear equation, then, 
formally at least, 


p(x.)= [Flee 'de (11.15) 
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is also a solution. The arbitrary function F(«) may be chosen to fit 
arbitrary initial or boundary data, provided the data are reasonable 
enough to admit Fourier transforms. If there are n modes with n different 
choices of W(k«), there will be n terms like (11.15) with n arbitrary 
functions F(«). It will then be appropriate to give n initial conditions to 
determine the solution. The examples (11.6)-(11.8) all have two modes and 
it is appropriate to prescribe m and g, at t=0. As happens in these cases, 
the two modes will often be w= + W(x) and, in a typical: one dimensional 
problem, we have then 


00 ° - oo ° . 
φ- [ Fy(«)e™— Orde + f F,(x)e** tM, (11.16) 
— οὦ vas 
with initial conditions 


P=GP(x), P= ,(x) 


at ¢=0. If W(x) is odd in x, as in (11.7), the first term in (11.16) represents 
waves moving to the right and the second term represents waves moving to 


the left. If W(x) is even, as in (11.6) and (11.8), waves moving to the right 
and left appear in both contributions. Applying the initial conditions, we 
require 


wo(x)= [ (δι) + Fa() } eal 


00 - 
φι(χ)- -ἶἰ W(«) { F,(«) — F(x) }e™ dk. 
The inverse formulas give 


Fi() + F(x) = O(n) τς [ἡ φοί(χ)ε "ἄχ, 


ταν [Ρι() Fy) )} =O) τς fixe de, 


and F,(k), F,(«), are determined to be 


iP, (κ 
δ - 3 φῶς Ἔκ} 

ἰΦ (κ 
ΟΣ ἘΣ | 
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Since o(x),p,(x) are real, ®(—«)= @4(x) and Φ, (-- κ)ἘΞ Φ΄'(κ) where 
asterisks denote complex conjugates. It follows that for W(x) odd 
F,(—«) = F¥(«), 
(11.17) 
F,(—«) = FR(«); 
and for W(x) even 
F,(—«) = F3(«), 


(11.18) 
F,(—«) = FF(«). 


In either case, the solution (11.16) is real: real initial conditions must lead 
to real solutions for real equations. 

A standard solution from which others can be reconstructed is 
obtained by taking 


Po(x)=S(x), φι(χ) =9. 


Then F,(k)= F,(k)= 1/47 and (11.16) reduces to 
1 co 
φΞ -( οοβκχ cos W(x) tdk; (11.19) 
7 YO 


of course this is a formal integral to be interpreted as a generalized 
function. 


11.3 Asymptotic Behavior 


Although the Fourier integrals give exact solutions, the content is hard 
to see. It becomes clearer and one starts to understand the main features of 
dispersive waves by considering the asymptotic behavior for large x and ¢. 
We consider first the typical integral 


(x,t) =f F(«)e™ SSK 
2 
- οὐ 


in the one dimensional case. For wave motions we are interested in the 
behavior for both large x and 1; the interesting limit is 00 with x /t held 
fixed. (A particular choice of x/t allows us to examine waves moving with 
that velocity.) Accordingly, the integral is written 


es [Fen ἐς, (11.20) 
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where 
χ(κ)- WK) — «= 


For the present, x /t is a fixed parameter and only the dependence on x is 
displayed in x. The integral in (11.20) may then be studied by the method 
of stationary phase; this is, in fact, the problem for which Kelvin de- 
veloped the method. Kelvin argued that for large ¢, the main contribution 
to the integral is from the neighborhood of stationary points k=.k such 
that 


x(k) = W'(k) ~~ =0. (11.21) 
Otherwise, the contributions oscillate rapidly and make little net contribu- 
tion. The later development of the method of steepest descents (or saddle 
point method) is easier to justify and to assess errors. A full discussion of 
the methods is given, for example, in Jeffreys and Jeffreys (1956, Sections 
17.04-17.05). It will be sufficient here to derive the first term in the 
asymptotic expansion following Kelvin’s argument. 


The functions F(«), x(«) in (11.20) are expanded in Taylor series in the 
neighborhood of «=k. The dominant contribution comes from the terms 


F(«)=F(k), 
2 "“ 
X(«)=x(k) + (κ-- Κὐχ' (Χ), 
provided x”(k)#0. With these approximations, the contribution is 


F(k)exp { ~ix(k ye} [΄ exp | 7 5 (ek) x(k) t} de, 


The remaining integral is reduced to the real error integral 


ΕΣ : 1/2 
f edz = (=) 
a 


πο 


by rotating the path of integration* through +7 /4; the sign should be 
chosen to be the same as the sign of x”(k). Then we have 


2π , [π pal 
F(k) ν ἡχ" exp — ix(k)t— 4 S8nx 


“This corresponds to changing to the path of steepest descents. 
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If there is more than one stationary point k =k satisfying (11.21), each one 
contributes a similar term, and we have, 


> 71 20 [π 
ae Ε(Κ ee kx - -- -- Ww” 
φ ΜΝ ( ) ἡ" (| exp Xx iW(k)t , sgn (k)}. (11.22) 


points k 


The next term in the asymptotic behavior requires the Taylor series 
continued as far as the (x —k)? term in F(«) and the («—k)* term in x(k). 
Two further terms are required because the odd powers integrate out 
eventually. When this is done, preferably using the method of steepest 
descents, the additional term may be written as a factor 


(= PW ας WE NW 


l+— 2 NN gi De In 
t|w”| 2} 2 w” F = 24 μγ΄: 8 | (11.23) 


multiplying the term in (11.22). The complicated form comes from the 
necessity of working to two further terms in the Taylor series for F and x. 
In general, this series continues in inverse powers of ¢ with coefficients 
which are functions of k. 

Earlier, the precise meaning of “large ¢” was left vague. The require- 
ment may now be taken to be that the correction term in (11.23) be small; 
t must be large compared with time scales derived from the dispersion 
relation and from the length scale in the initial conditions. For sharply 
peaked initial conditions with small length scale, Ε΄ and Ε΄ are small and 
the requirement is that ¢ be large relative to the typical period in W(k), 
which in turn is given by parameters in the equation. For the extreme case 
of a ὃ function initial condition, F is constant and F’= F” =0. 

For the special case of two modes with w= + W(x), the full solution 15 
given by (11.16). We make the further assumption that W(x) is monotonic 
and positive for κ᾿ (this is usually the case), and we consider the 
asymptotic behavior of (11.16) for x >0. If W(x) is odd, W’(«) is even and 
(11.21) has two roots +k. These two contributions in (11.22) can be 
combined, since F,(— k)= F¥(k) from (11.17), and we have 


o~29( F160 \ prs exp| ikx —iW(k)t- i7sen ws)}} 


100, >0, (11.24) 
where k(x,¢) is the positive root of (11.21) defined by 
k(x,t):  W(k= = k>0, " >0. (11.25) 
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For odd W(k), the second integral in (11.16) does not contribute to the 
solution in x >0; it gives a corresponding expression for x <0. 

When W(x) is even, W’(«) is odd; (11.21) has one root k for x >0 and 
it is positive. There is only one contribution from the first integral in 
(11.16). However, for the second integral in (11.16), the stationary points 
satisfy 


y x 
Wi(K)=— =, 


and —k is a solution of this for x >0. That is, with k defined as in (11.25), 
there is a stationary point x=k in the first integral of (11.16) and a 
stationary point x= — k in the second one. In view of (1 1.18), the contribu- 
tions can again be combined and the net result is the same formula as 
(11.24). 

The significance of the condition W”(k)40 in the definition of dis- 
persive waves for linear systems is now clear. If W'"(k) is a constant, there 
are no stationary points for general x /t and the whole asymptotic analysis 
is different. Of course it is not necessary, since the Fourier integrals can be 
simplified immediately. The significance of W"(k)#0 appears also in the 
denominator of (11.24) and in the error term (11.23). If W(x) is not 
identically zero, but vanishes for some particular stationary point k, the 
correct asymptotic behavior is found by going to further terms in the 
Taylor series for x. If x”(k)=0, but x(k) #0, the contribution to (11.20) 
1s 


F(k)exp{ — ix(k)t} [ exp - ξιχ"()α - ky} di 


F(k 
τω i73 XP ikx — iW(k)t}. (11.26) 


-}}35),623. 
(¢|W'(k))) 


Since k is a function of x/t this would indicate a singular behavior on the 
corresponding line x ἐξ W'(k) and in its neighborhood. 
We now take up a detailed discussion of (11.24) and (11.25). 


11.4 Group Velocity; Wave Number and Amplitude Propagation 


At any point (x,7), (11.25) determines a certain wave number k(x, 0), 
and the dispersion relation w= W(k) gives also a frequency w(x,?) at that 
point. We may introduce a phase 


O( x, t) = xk(x, t) — tox, 1), 
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and (11.24) may then be written 


p= R{A(x, eM}, (11.27) 


where the complex amplitude is 


2 — (mi /4)sgn W” 
A(x,t)=2F,(k) ΠΤ κυ ων (11.28) 


The expression (11.27) is in the form of the elementary solution, but A,k,w 
are no longer constants. However, the solution still represents an oscil- 
latory wavetrain, with a phase @ describing the variations between local 
maxima and minima. The difference is that the wavetrain is not uniform; 
the distance and time between successive maxima are not constant, nor 15 
the amplitude. 

It is natural to generalize the concept of wave number and frequency 
in this nonuniform case by defining them as 6, and —6,, respectively. 
Counting the number of maxima in unit distance would obviously be a 
clumsy and ill-defined quantity, whereas 6, is straightforward and does 
correspond to the intuitive idea of a local wave number. Moreover, in the 
present case, we have 


O(x,t)=kx—Wk)t, 


00 dk 
Us — W' — 1.29 
ἜΝ k+{x-W'(k)t} ax (11.29) 
00 <p Ok | 

ae W(k)+{x W'(k)t} ὟΝ 

the stationary condition (11.25) eliminates the terms in k,,k,, and we have 
just 


56 =k(x,1), (11.30) 
06 
-- Wk) = — 02). (11.31) 


Thus the wave number k, which was first introduced as a particular value 
of the wave number in the Fourier integral, agrees with our extended 
definition of a local wave number 9, in an oscillatory nonuniform wave- 
train. The same is true of the corresponding frequency. Furthermore, the 
local wave number and local frequency satisfy the dispersion relation even in 
the nonuniform wavetrain. 
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These extensions work so neatly because the nonuniformity is not too 
great. If an oscillation is too irregular, we might find a phase function 6 
and then define 9, as the wave number, but the intuitive interpretation 
would be lost if 0, itself varied rapidly in the course of one oscillation. In 
our case, k(x,/) is a slowly varying function. From (11.25), 


Κι W' 1] Κι 1 Ι 


k kW" x? ko RW"(k) τ 


and x, ¢ are both relatively large. Therefore the relative change in one 
wavelength or one period is small. Thus k is a slowly varying function in 
this sense; the same is true of w. [Again we note the singular behavior near 
any points with W”(k)=0.] From (11.28), it is easily seen that A is also 
slowly varying. 

With these interpretations of the quantities appearing in (11.27), we 
return to the determination of Κα and w as functions of (x,7) from (11.25), 
and the determination of A from (11.28). Equation 11.25 determines k as a 
function of x and ¢, but to appreciate its content we turn it around and ask 
where a particular value ky will be found. The answer is: at points 


x=W'(ko)t. 


That is, an observer moving with the velocity W’(k,) will always see waves 
with wave number ky and frequency W(k,). The quantity 


is the group velocity; it is the important velocity for a “group” of waves 
with a distribution of wave number. The interpretation of (11.25) shows 
that different wave numbers propagate with the group velocity; any particular 
wave number Κρ is found displaced a distance W’(k,)t in time f. 

Any particular value 0, of the phase propagates according to 


4( x,t) =p. 
Hence it moves according to 
dx 
—+64=0 
ie dt ὑπο 
that is, 
dx 0, 
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Thus the phase velocity c is still given by w/k even though the meanings of 
ω and k have been extended. But it is different from the group velocity. An 
observer following any particular crest moves with the local phase velocity 
but sees the local wave number and frequency changing; that is, neighbor- 
ing crests get farther away. An observer moving with the group velocity 
sees the same local wave number and frequency, but crests keep passing 
him. 

As illustration of this important distinction, consider the beam equa- 
tion. The dispersion relation is 


W(k) = yk’. 
Hence (11.25) becomes 


W'(k) =2yk= =, 


and we have 


x x? ~ 


k= — ἘΣ ΠῈΣ -- 


Ayt?’ 4yt 


The group lines of constant k and w are 
ΞΠΕΤΞ constant; 
2yt 

the phase lines of constant @ are 


2 
Δ =constant. 
A4yt 


These are shown in the (x; ἢ) diagram in Fig. 11.1. In this case the group 
velocity 2yk is greater than the phase velocity yk. 


We 
LA 
k=@ Χ 


Fig. 11.1. Group lines (solid lines) and phase lines (dashed lines) for waves in a beam. 
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k=O x 


Fig. 11.2. Group lines (solid lines) and phase lines (dashed lines) for deep water waves. 


For water waves in deep water (see Chapter 12), the dispersion 
relation is W= V gk . Hence W'(k)=x/t leads to 


is _ gt gt 
Ae τι άχ᾽ 
The group velocity + Vg/k is less than the phase velocity Vg/k and we 
have the situation shown in Fig. 11.2. 

In all cases (for the uniform media considered so far) the group lines 
are straight lines whereas the phase lines are not; each wave number 
propagates with a constant velocity; each phase accelerates or decelerates 
as it passes through different wave numbers. 

If we take 6 function initial conditions so that F,(k)=1/47, the 
amplitudes A(x,7) can be given explicitly as well and the complete asymp- 
totic solutions are 


] x? 7 
p~— 00s ὅς - τ) (11.32) 
ν΄ 4πγι ἄγ «Ἢ 
for the beam, and 
1/2 2 
1/8 t StU og 


for water waves. [In fact, (11.32) is exact since W(k) is exactly quadratic 
for the beam. ] 

A second important role of the group velocity appears in studying the 
distribution of A(x,f). The form of (11.28) suggests that |A|? is the 
interesting quantity to consider, and this is also natural on physical 
grounds since it is an energylike quantity. The relation of |A|’ to the true 


Sec 11.4 GROUP VELOCITY 379 


energy density and to the so-called “wave action” will be pursued later. 
For the present, 4(χ,0) is a well-defined quantity given by (1 1.28) and we 
may consider |A|’. 

The integral of |A|? between any points x,>x,>0 is given from 
(11.28) by 


Q(t) = [4456 


ἊΣ E(k) F*(k) 
πῆι] ———~~dx. 
ως Ww" (k)| 


In this integral k is given by (11.25). Since k appears in the arguments of 
the integrand and x does not, it is natural to introduce k as a new variable 
of integration through the transformation 


x=W'(k)t. 
For W”(k)>0, we have 


ky 
Q(t) =8 f F,(k) F*(k)dk, (11.34) 


where k, and k, are defined by 
x,=W'(k,)t, = x2 = Μ΄ (Κ4)}. (11.35) 


If W’(k)<0, the order of the limits is reversed. 

Now, if k, and k, are held fixed as ¢ varies, Q(t) remains constant. 
According to (11.35), the points x, and x, are then moving with the 
corresponding group velocities. We have therefore proved that the total 
amount of |A|* between any pair of group lines remains the same. In this 
sense, |A|? propagates with the group velocity. The group lines diverge with a 
separation increasing like t; hence [4] decreases like τ '/”. 

In the special case of a wave packet where the initial disturbance is 
localized in space and contains appreciable amplitude only in wave num- 
bers close to some particular value k*, say, the resulting disturbance is 
confined to the neighborhood of the particular group line k* and the wave 
packet as a whole moves with the particular group velocity W’(k*). 
Accounts of group velocity in the literature are frequenctly limited to this 
case. The foregoing arguments are more general, however, and allow a 
general distribution over all wave numbers with the full dispersion shown 
in Figs. 11.1 and 11.2 as the entire range of values of W’(k) come into 


play. 
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11.5 Kinematic Derivation of Group Velocity 


The concept of group velocity is so fundamental in understanding the 
wave motion that one feels it should not be only the end product of 
Fourier analysis and stationary phase. In nonuniform media, or for non- 
linear problems, where the Fourier analysis cannot be carried out in this 
way, Surely the same concepts must appear and be equally important. How 
do we free the ideas from the Fourier analysis? 

To see how to generalize the results, we look at their derivation by 
more intuitive arguments. The arguments can always be checked against 
the previous discussion, or justified eventually by direct asymptotic 
methods. The advantages are tremendous, since we can then make pro- 
gress on approximate treatments of problems for which the exact solutions 
are not known. At the same time, we obtain quicker and fuller insight even 
in those problems where the exact solution can be found. : 

We first consider the role of group velocity in determining the propa- 
gation of wave number and frequency. On reexamination of the arguments 
we see that very little was required. First, if we assume there is a slowly 
varying wavetrain and that a phase function 0(x,/) exists, we can define 
local wave number and frequency by 


k=0,, w=—§,. (11.36) 
If, further, we know or can propose a dispersion relation 
w= W(k), (11.37) 


we have an equation for 9 and we could proceed to solve it to determine 
the geometry of the wave pattern. It is usually more convenient, however, 
to eliminate θ in (11.36) to give 

d dw 

-- 4+ — = . 

5; ἢ ἢ. 0, (11.38) 
and use this in conjuction with (11.37) to solve first for k(x,t) and w(x, ὃ). 
Notice that this formulation is the basic one for the nonlinear waves 
discussed in Chapter 2. Indeed, k is the density of waves, w is the flux of 
waves, and (11.38) is a statement of the conservation of waves! Substitut- 
ing the dispersion relation (11.37) in (11.38), we have 

0k 


dk 
a tla =0,  C(k) = w(K). (11.39) 


The group velocity C(k) is the propagation velocity for the wave number k. 
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According to the analysis in Chapter 2, the general solution of (11.39) 
for an initial distribution k= f(x) at t=0 is 


k=f(&), x=E+ C(E)z, (11.40) 


where 


C(EV=C { f(E) }- 


The special case of a centered simple wave arises when the range of values 
of k is initially concentrated at the origin. Then k(x, τ) is determined from 


x=C(k)t. 


This is just the determination of k given in (11.25) and represented in Figs. 
11.1 and 11.2. The validity of the asymptotic expansion (11.24), noted from 
(11.23), requires x and ¢ so large that the initial disturbance appears to be 
concentrated at the origin. 

But, the concepts are already extended. The slowly varying wavetrain 
defined by 6(x,f) need not have originated from a relatively concentrated 
disturbance at the origin, and the distribution of k(x,t) may be more 
general, as in (11.40). Moreover, there is no necessity for the solution φ to 
be sinusoidal in @; any oscillatory wavetrain with a definable @ and a 
dispersion relation between k and ὦ is included. 

It is interesting and significant that (11.39) for & is nonlinear, even 
though the original problem is linear, and that it is hyperbolic, even though 
the original equation for φ is not in general. This is the first instance of 
hyperbolic equations arising for the propagation of important overall 
quantities like k. In this sense one can preserve the association of wave 
propagation with hyperbolic equations, but there is a considerable non- 
hyperbolic substructure. 


Extensions. 


The simplified derivation of the group velocity is readily extended 
further to linear problems in more dimensions and in nonuniform media. 
The extension to nonlinear problems must await further developments, 
because the dispersion relation then involves also the amplitude. For 
multidimensional equations with constant coefficients, the exact solution 
can still be obtained from the use of multiple Fourier integrals and the 
asymptotic expansion can then be obtained from stationary phase. For n 
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space dimensions, it is easy to show that 


oo - « 
p= [ Ε(κ)οῖ κα τ Wen ae 
= 


τεῳ 2)" {a “0 Π | expen amas) (11.41) 
t 0k, 0k, 
where 

Xi 0W(k) 

t 0k, ᾿ 


and ¢ depends on the number of factors wi /4 arising from path rotations. 
However, let us use the simpler kinematic derivation and at the same time 
include nonuniform media. 

The description of a slowly varying wave in, say, three dimensions 
involves a phase (x,t) where x=(x,,X,,x;). We define the frequency ὦ 
and vector wave number k by 


ὄξει . Ae. (11.42) 


We assume that a dispersion relation is known, and that it may be written 
w= W(k,x, ἡ. 


For a uniform medium, this would be obtained from the elementary 
solution (11.1). For a slightly nonuniform medium, it would appear reason- 
able to find the dispersion relation first for constant values of the parame- 
ters of the medium and then reinsert their dependence on x,t. For 
example, if a,8,y were slowly varying functions of x or t in the problems 
(11.6)-(11.9), we would use the same dispersion relations displayed there 
but with a,8,y, taken to be the specified functions of x and ¢. Intuitively, 
this would appear to be a satisfactory procedure provided that a, 8,y, vary 
little in a typical wavelength and period. This will be substantiated in 
Sections 11.7 and 11.8. 
If we eliminate θ from (11.42), we have 


ὟΣ dk ok, 
Poe ep ey (11.43) 
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Then, if w= W(k,x, 2) is introduced into the first of these, 


Ἵ 
where 
0W(k, x,t) 
Ok, ; 


J 


C,(k, x,t) = (11.45) 


The three dimensional group velocity C is defined by (11.45) and is the 
propagation velocity in (11.44) for the determination of Κ,. Equation 11.44 
may be written in characteristic form as 


Pita οὐ oy ee (11.46) 


We note that k is constant on each characteristic when the medium is 
uniform in x, and then the characteristics are straight lines in the (x, 1?) 
space. Each value of k propagates with the corresponding constant group 
velocity C(k). But this is not true in a nonhomogeneous medium, for then 
the values of k vary as they propagate along the characteristics and the 
characteristics are no longer straight. It might also be remarked that 


dw dw , dw _ OW. 


ee an eel 
— 


dt δὶ ὃχ, at’ 


the frequency is constant on each characteristic when the medium is time 
independent but not otherwise. 

It is interesting that the equations of (11.46) are identical with Hamil- 
ton’s equations in mechanics if x and k are interpreted as coordinates and 
momenta and W(k,x,/) is taken to be the Hamiltonian! If instead of 
eliminating 8, we substitute for w and k in the dispersion relation, we have 

00 00 


8 w( Ext} =0. (11.47) 


This is the Hamilton-Jacobi equation, with the phase @ as the action. 
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If W is independent of x and f¢, the solution of (11.46) for an initial 
distribution k; = f,(x) is | 


k,=f(€), x, =& + C,(§)i, (11.48) 


where 
C(§)=¢, { £(€) }. 


Again the centered solution corresponding to a whole range of k released 
at the origin at +=0 is found by determining k(x, 1) from 


x,=C(k)t. (11.49) 


This is the special case obtained in the asymptotic expansion (11.41) of the 
multiple Fourier integral. 

Examples of the use of these various equations will be given in 
Chapter 12. 


11.6 Energy Propagation 


The preceding kinematic derivation shows one role of the group 
velocity and determines the geometry of the waves. The second role of the 
group velocity is in connection with the amplitude distribution A(x,?) in 
(11.27)-(11.28). We would like to have direct access to the behavior of A 
and its involvement with the group velocity in much the same spirit. It 
looks feasible since energy is apparently involved and we expect to be able 
to make a direct statement of energy balance. This is the case. However, 
recent work using variational formulations has not only improved and 
generalized the derivations, it has also shown that “wave action” rather 
than energy is perhaps the more fundamental concept in this connection. 
The variational approach is subtle and it is useful to prepare the ground 
with a more traditional discussion of energy propagation. 

We start as before with the one dimensional problem for a uniform 
medium and see how to obtain information on the amplitude distribution 
without using the Fourier integral solution. In this first approach we are 
forced to work with specific cases. The Klein-Gordon equation 


2 
φ,-αφ,, + B ~=0 
is one of the simplest to use, since it keeps the order of derivatives as low 


as possible. It is hyperbolic and is exceptional in that respect, but we are 
concerned only with oscillatory parts of the solution, not with wavefront 
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behavior. The accompanying energy equation is easily obtained, and for 
constant coefficients a, 8, it is 


d/l l ] 0 
ἀι(Σ φῦ χαῖρε 78") τς (--αἶρφ,)πο, (1150) 


We now consider ἃ slowly varying wavetrain in which 
φ-- R(Ae”)=acos(8+ ἡ), 
a=|A|, n=argA, 


and we compute the energy density and energy flux. A term like 1g? will 
have 


5 oa sin® (θ- η) 


together with terms involving a, and n,. Because a and ἢ are assumed to be 
slowly varying these latter terms are neglected in the first approximation. 
Treating the other terms similarly we see that the energy density is given 
approximately by 


ὁ (w? + a°k?)asin? (0+) + + B’a? cos* (θη) (11.51) 


and the energy flux by 
a’wka’ sin? (8+ ἡ). (11.52) 


In cases where higher derivatives occur, extra terms in the derivatives of w 
and k would also arise but be neglected because w and k are also slowly 
varying quantities. 

Since we are concerned with variations of the overall quantities w, k,a, 
and not with the details of the oscillations, we consider the average values 
of (11.51) and (11.52). The average values of cos*(@+7) and sin*(@+) 
over one period are both equal to one half, so we have 


ὃ = ων a’k? + B*)a?, (11.53) 
OF = 5 aka’, (11.54) 


for the average values of the energy density and the energy flux. In this 
particular problem, the dispersion relation is 


w=YVark*+ B? ; (11.55) 
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hence 
& = 5 (7k? + Ba, F= xa 2kVla?k? +B? a2. (1.56) 


The group velocity is 


Cha ὦ (11.57) 


and we observe the important result that 


=C(k)&. (11.58) 
This turns out to be general. 
It is now tempting, on the intuitive grounds that energy must be 
balanced overall, to propose the “averaged” energy equation 


06 


ἜΡΓ +2(C&)= 0 (11.59) 


as the equation to determine a. This is the differential form of the 
Statement that the total energy between any two group lines remains constant. 
For if we consider the energy 


E(t)= f eure (11.60) 


between points x,,x,, moving with the group velocities C(Kk,), C(k), 
respectively, we have 


dE 29 ὃ 
“1. “Ξάχεῦς,ὃ,-- ς δ, (11.61) 


and from (11.59) this is zero. Conversely, (11.59) is just the limit of (11.61) 
as X,— x,->0. 

This behavior was found in Section 11.4 for a? itself rather than for 
& . However, & =f(k)a?, and when this is substituted in (11.59) the 
resulting equation can be expanded to 


iy, ἧς = + οὐ) + Fhe? { E+ Gt 0. (62 
Since 


a7 + Gay =0 (11.63) 
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from (11.39), we have 
2 
ba" + ὃ (ca?\=0. (11.64) 


We see that any function of k can be slipped in or out of equations (11.59) 
and (11.64), provided (11.63) holds. Now, by the same argument given for 
& , (11.64) is the differential form of the result found in Section 11.4 that 


Q(t) = [ax (11.65) 


remains constant between group lines. Hence (11.64) and (11.59) are 
confirmed. Direct justification will appear later. 

We might also note that the characteristic forms of (11.63) and (11.64) 
are 


dx 


2 
ΣΕ τως hi Ξ τ 2 
0, Ο΄ (ΚῚΚ, α΄, τ 


=C(k). (11.66) 
[In the second equation k, can be treated as a known quantity because 
k(x,t) would be determined first; this is the exceptional case of Example 7 
in Section 5.2.] The group velocity C(k) appears as a double characteristic 
velocity, corresponding to the dual role noted in Section 11.4. 

The asymptotic solution obtained in Section 11.3 is the special case of 
a centered wave, where k(x,?) is the function of x/t determined from 


In this case the amplitude equation is 


Since k itself may be used as a characteristic variable, the solution may be 
written 


a=t~'/*@(k) 


where @ (kK) is an arbitrary function. This agrees with (11.28) and again 
confirms the validity of the approach. Of course the function @ (k) cannot 
be determined without some tie to the initial conditions, and this cannot be 
found from asymptotic discussion alone. 

In this initial value problem, we know in fact that @ (k) is given by 
(11.28), and it is interesting to note that the energy E(t) between the group 
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lines k=k, and k=k, as given by (11.60) is therefore 


“2 FAK) FHk 
E(t)=8 f Kk) 


Κλ Η | 
=8n [ SAF (FER) dk, (11.67) 


where f(k) is the factor 4(a’k*+ B*) appearing in (11.56). From (11.50), 
the exact total energy is 


Be (lee eee hea 
tot 5 Pe 2 x 2 φ x 


and from the exact solution (11.16), together with relation (11.18), this may 
be put in the form 


E,,.=87 fo fle) F,(«) F#() de. (11.68) 


This applies at all times both before and after the dispersion into a 
wavetrain; it shows the distribution of energy over the wave number range. 
But after dispersion the wave number range is spread out explicitly as a 
distribution over x. The form of E(f) in (11.67) shows that the same 
amount of energy is still associated with the range k,<<« «Κις. That is, the 
energy put into any wave number range remains there. 

The energy arguments leading to (11.58) and (11.59) are easily ex- 
tended to more dimensions. For the Klein-Gordon example, the energy 
equation becomes 


d/1 5,1 3 
ἐπι [ τῷ τα φ, + Σβ᾽}) Ἐπς {-- αἴῳ,φ.)Ξ0, 
J 


and, for a slowly varying wavetrain p~acos(@+ ἡ), its averaged form is 


ot Ox, ; 


where 


& = (δε ark? τ B°)a’, G = 5 0k a? 
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It may be verified from the dispersion relation that 
$= CoS, (11.69) 
and then the averaged energy equation becomes 


Le + a G S)=0. (11.70) 


The total energy in any volume moving with the group lines remains 


constant. For 
a & dV= ee 0 ἐδ Ὁ" & C,n, aS, 
dt Jv) V(t) 


where S(t) is the surface of V(t), n, is the outward normal to S(t), and Cn, 
is its normal velocity. From the divergence theorem, (11.70) shows this to 
be zero. The characteristic form of (11.70) is 


dC, dx. 
d& Jj Xj = C,(k); 


ee ---...- oO 


dt ax, "at 


the energy density decays due to the divergence dC,/0x, of the group lines. 
For the uniform medium, k remains constant on the group lines [see 
(11.46)]. Therefore, since & = f(k)a’, a” satisfies the same equations. This 
may also be verified directly from (11.70) with appropriate extension of 
(11.62). For the centered wave corresponding to (11.41), k is determined 
from 


X; 
— =C,(k); 
t 

hence 
da? μα 
dt ἤ 


where n is the number of dimensions. This agrees with the amplitude 
variation in (11.41). 

We see that the averaged energy equation does indeed give a correct 
description of the amplitude distribution in accordance with the behavior 
found earlier. It is satisfactory in that it provides an approach that avoids 
the Fourier transforms and so offers the hope of generalization, but in the 
present form it is not completely satisfactory in that the seemingly general 
results (11.69) and (11.70) appear only at the end of manipulations using 
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the specific equation. If we repeat the same type of argument on the other 
linear examples in (11.7)-(11.9), exactly the same final results (11.69) and 
(11.70) are found. 

For example, the energy equation for (11.7) has an energy density 


1 5, 122,11 5)22 
ae 22 + a 
5 Ὁ TOPs, 58 Px,t 
and energy flux vector 
= ap, Px -β *, Prux,: 


The average values obtained by (1) substituting p~acos(@+ 7), (2) ne- 
glecting derivatives of a, ἡ, k,, and w, and (3) replacing cos?(@+7) and 
sin?(@ +) by their average values of one-half, are 


] 
Ὁ = πίω" Ὁ ak? + B2w°*k?)a, 


it is verified that 
F = 
5 = Cb, 
and the average energy equation can again be written as 


0 & 0 

Pine a a 

The same results are found for the remaining examples in (11.8) and (11.9). 
It seems clear that these important general results should be estab- 

lished once and for all by general arguments without pursuing the detailed 

derivation each time. Such arguments (and much more) are provided by 

the variational approach. 


11.7 The Variational Approach 


This approach was originally developed for the much more difficult 
case of nonlinear wavetrains and it has many ramifications. A full account 
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must await the further development of these topics, but we can take it far 
enough to complete the above discussion. 
We consider first the variational principle 


bJ=5 f f L(v,.9,.p)dtdx=0 (11.71) 
R 


for a function φίχ,). The variational principle means that the integral J[q] 
over any finite region R should be stationary to small changes of in the 
following sense. Consider two neighboring functions φίχ,ἢ) and (x,/)+ 
h(x,t) where h is “small”; since first derivatives appear in (11.71), both 
functions are taken to be continuously differentiable. The smallness of ἢ is 
measured in this context by the “norm”: 


|| 2|| =max|h| + max|A,| + max|A, |. 


The function L is usually some rather simple function and certainly we can 
suppose that it has bounded continuous second derivatives. Then by 
Taylor’s expansion 


J[pth]—Jlol= ff {Lat Ly; + Leh} dtdx+ O(|Al?), (11.72) 
R 


where @ , denotes dp/0x,. The expression linear in h is the first variation 
δ [φ, h]. The variational principle (11.71) requires that 6/[y,A]=0 for all 
admissible h. By integration by parts (the divergence theorem) we have 


a 
SI L@,h] = ff{-3 < Ly ἀῶ τις | had (11.73) 


if we choose, in particular, functions ἢ that vanish on the boundary of R. 
We now require (11.73) to vanish for all such h. This implies 
0 0 


ΕΝ ΤῊΣ π δ (11.74) 


by the usual continuity argument. [If (11.74) were nonzero, say positive, at 
any point, then there would be a small neighborhood in which it remained 
positive; a choice of h positive in this neighborhood and zero elsewhere 
would violate the requirement that (11.73) vanish.] 

The argument extends in natural fashion if L includes second or 
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higher derivatives in m. The corresponding variational equation 15 


Lo Or ewe Ox, Ly. ἘΞ Ξ Σ at 2 Ly Ἔ ardx, | + ax, ax, φ, jk ae - 0, 
(11.75) 


which is easily recognized as the end result of the repeated integration by 
parts. Equations 11.74 and 11.75 are partial differential equations for 
(x,t), and equations in this form can be given the equivalent variational 
formulation. A variational principle involving a number of functions 
g(x, t) would lead to (11.75) for each p(x, ) (since they could be varied 
independently), and hence to a system of equations. The question of 
finding a variational principle for a given system of equations can be a 
difficult one, but it is usually straightforward when only a single equation 
is involved. We note that Lagrangians for the examples (11.6)-(11.8) are 


_1 2 1.22, 1 p22 
L= 79; 50s, + ZB Pix, (11.76) 


ey ey 1 5 
L= 2 φ, 2 Y Pxx 
respectively, and (11.9) is included by substituting p=y, and taking 
ἘΦ 1 2: ΤΑΣ 
τιν 2 Ww, + 2 αψν 5 Puy. 
To study slowly varying wavetrains in which 


p~acos(9+7), (11.77) 
we now calculate the Lagrangian L in exactly the same way as the energy 
density and flux were calculated in the last section. That is, (11.77) is 
substituted, derivatives of a, ἡ, ὦ, k are all neglected as being small, and 
the result is averaged over one period. The result in each case is a function 
° (w,k,a); in particular, for the examples in (11.76) we have 


ΕΠ 
ΡΞ ΤΩ’ - a7k? + B?w*k*)a’, (11.78) 


ΒΞ (w?— 7k*)a?, 
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We now propose the “average variational principle’, 


5 { { £(-4,,0,,a) dtdx=0, (11.79) 


for the functions α(χ, ἢ), 0(x,t). This is similar to the proposal in (11.59), 
but it certainly is a much more subtle affair which will have to be 
examined in detail later. However, accepting it for the present, we shall see 
immediately that it provides a general and extremely powerful approach. 

Since derivatives of a do not occur, the variational equation (11.75) 
for variations in a is merely 


δα: ef =0. 
The variational equation for @ is 


ὃ ϑ _ 


60: 
In these expressions the dependence on @ involves only its derivatives. 
Accordingly, once the variational equations have been obtained, it is usually 
convenient to work again with w,k,a, and take the set of equations 


β, =0, (11.80) 
0 0 = 
ry ΟΜ Fa δὰ (11.81) 
Ok; dw 0k; 0k; 
of on dx, dx, = #2) 


the latter being just the consistency equations for the existence of @. 
Equation 11.80 is a functional relation between 6,k,a, so it can be no 
other but the dispersion relation. We check from (11.78) that this is the 
case. Indeed in any linear problem, it is clear that L must be quadratic in » 
and its derivatives, and that as a consequence £ must always take the form 


& = G(w,k)a’. (11.83) 
Then from (11.80) the dispersion relation must be 
G(w,k) =0, (11.84) 


and the function G(w,k) in (5 is nothing but the dispersion function. We 
did not even need to calculate © for each case! 

This is all an unexpected bonus. The aim was to find a general 
argument for the amplitude equation, but we have actually included the 
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kinematic theory proposed in Section 11.5 for the geometry of the wave 
pattern. Equations 11.80 and 11.82 provide just that theory. 

We note that the stationary value of £ is in fact zero. In those simple 
cases in which L is the difference of kinetic and potential energies, this 
proves [subject to the eventual justification of (11.79)] that their average 
values are equal. This is the well-known equipartition of energy for linear 
problems. 

Turning next to the amplitude equation (11.81), we note that it may 
now be written 


ὃ 9 
5; (6.4) -- 5 - (G,a") =0. (11.85) 


J 


In principle (11.84) can be solved in the form w= W(k) so that 


G { W(k),k} =0 
is an identity. Therefore 
OW = 
σι ak, Ἔ Gy, 0 
and the group velocity 
G | 
SOW τον 
C= ak, τ (11.86) 


If we denote G,(W,k) by g(k), (11.85) may be written 
9 0 
a, ( 5(1)α5) Ἐ το { a(k) G(k)a*} =0. (11.87) 
J 


From (11.82) we have 


0k; 0k; 0k, dk, 
—+cC,— =), — -- — =0; 
Or J 0x; 0x; OX; 


by using these, the factor g(k) can be slipped out of (11.87) and we have 
the amplitude equation 


Thus the variational set (11.80)(11.82) does give precisely the set of 
equations discussed in the last two sections. 
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At first sight one might expect that (11.87) is the averaged energy 
equation (11.70). But a check on examples shows that the factor f(k) in & 
and the factor g(k) are not the same. However, there is a standard 
procedure for associating an energy equation with a variational principle. 
Noether’s theorem shows that there is a conservation equation correspond- 
ing to any group of transformations for which the variational principle is 
invariant (see Gelfand and Fomin, 1963, p. 177). If the principle is 
invariant to a translation in ¢, the corresponding equation is always the 
energy equation or a multiple of it. Since (11.79) is invariant to a transla- 
tion in ¢, this applies and the corresponding energy equation is found to be 


9 ὃ 
9, (WE £)+ pram as (11.88) 


Here, rather than pursue the detailed application of Noether’s theorem, it 
is sufficient to note that (11.88) follows from the system (11.80)-(11.82). 
This is the energy equation. One can easily verify that the previous 
examples agree. 


In the linear problems considered here, we found that the stationary 
value of © is zero. Hence the energy density & and flux % are given by 


δ =wf,, $= —wl,. (11.89) 
We see therefore that the quantity ©, is in fact 
᾿Ξ (11.90) 


and (11.81) or (11.87) may be written 


ΕΣ Ὁ ἾςΣ 
ΡΣ ἐξ} | =0. (11.91) 
From (11.83) and (11.89), we have 

δ =wG,,a’ 

3, = — Ga =C&, 


which gives the general proof of the relation between ¥ and &. 

But we have another bonus from the general approach. It draws prime 
attention to the quantity (11.90) and to (11.81) and (11.91). The quantity 
& /w is well known in ordinary mechanics as the adiabatic invariant for 
slow modulations of a linear vibrating system. We shall show later that £,, 
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is the appropriate quantity in the nonlinear case. Thus these concepts have 
been extended to the case of waves. Instead of an invariant we have the 
conservation equation (11.81) governed by a timelike adiabatic quantity £,, 
and spacelike quantities — Pe . This conservation equation has become 
known as the conservation of “wave action”. 

There is also a “wave momentum” equation which is the counterpart 
to (11.88) with the roles of x, and ¢ interchanged: 


ὃ ΓΝ ᾿ 
9p Ra) + a kf, + £6,)=0. (11.92) 


This is easily verified from the set (11.80)-(11.82). We note that the 
momentum density is 
k: 
k£,=—6&; (11.93) 
W 
it is a vector in the direction of k with magnitude & /c, where c is the 
phase speed. We again have the general proof of a familar result which is 
hard to establish by other means. 


Nonuniform Media. 


Another advantage of the variational approach is that there is no 
change in the basic equations (11.80)-(11.82) if the medium varies slowly 
with x and ¢. This would be the case, for example, if the parameters a, B,y 
in (11.76) were functions of (Χ, ἢ). If the change in one period 15 small, the 
average Lagrangian can be formed as before, neglecting the changes a, £, y 
in one period along with the contributions of the derivatives of w,k,a,7. 
Then (11.79) is proposed as before, the only difference being that ἡ now 
depends explicitly on x and ¢ as well as through the functions a(x,7/) and 
θ(χ, ἢ). However, the variational equations (11.80)-(11.82) are unchanged; 
one has only to be careful to include further derivatives in manipulating 
and expanding the equations. In particular, the energy equation now 
becomes 


3 ἢ 
aie B+ G(s) =~ δι, (11.94) 


as is easily verified. Similarly, the momentum equation (11.92) picks up a 
term &, on the right hand side. If the medium depends on /, energy is no 
longer conserved. If it depends on x, momentum is no longer conserved. 
But notice that wave action is conserved in all cases. This again shows the 
preferred position of (11.81) over the energy equation in modulation 
theory. 
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Nonlinear Wavetrains. 


Finally, the variational approach requires very little modification to 
study modulations on nonlinear wavetrains. The main questions will be the 
functional form to replace (11.77), the details of the averaging to find the 
function &, and, in general, the appearance of further overall functions 
similar to w,k,a in the complete description. In the simplest cases, how- 
ever, the latter do not arise and once £ (w,k,a) has been found, the set 
(11.80)-(11.82) still apply. The major difference is the crucially important 
one that © is no longer simply proportional to a”, and (11.80) and (11.82) 
do not uncouple from (11.81). These questions and the careful justification 
of the theory so far will be taken up in Chapter 14. 


11.8 The Direct Use of Asymptotic Expansions 


A more obvious way to avoid the Fourier integrals and open the way 
for extension to problems of nonuniform media and nonlinear systems is 
to substitute asymptotic series of the appropriate form directly into the 
equations of the problem. For the linear problems discussed so far, the 
required form of expansion 15 


lo 8) 
p~e®) S$’ A (x,0), (11.95) 
n=0 


where the A, are terms of successively smaller order in the relevant small 
parameter. In the present context, this form is suggested by the first term 
obtained in (11.27). It is also an extension of the geometrical optics series 
discussed in (7.62). For the earlier hyperbolic problems the relation be- 
tween 9, and 6, would be homogeneous, so that for fixed frequency w one 
may choose @(x,t)=wS(x,f) as was done in that discussion. Here the 
dispersion relation between 6, and @, is more general and we allow ἃ 
continuous distribution of frequency. 

The approach using (11.95) is satisfactory as far as it goes and it can 
be applied to problems of nonuniform media. But to a greater extent than 
was found in the discussion via the average energy equation in Section 
11.6, one works with the specific expressions for each particular problem 
only to find at the very end that the results are general. In the case of 
extensions to nonlinear problems, the correct form of expansion is not 
immediately clear, the manipulation of expressions may become hor- 
rendous, and again general results are hidden in the specific details. These 
weaknesses are remedied by applying expansions such as (11.95) directly in 
the variational formulation of the problem. This is, essentially, how the 
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variational approach is justified. But some ingenuity is involved and it is 
useful to include here as background some discussion of the direct applica- 
tion of (11.95) to the equations. It is sufficient to treat the one dimensional 
case. 

The expansion discussed in Section 11.3 is valid for t->0o, with x/t 
fixed. In that case θίχ,) and A,(x,?) take the form 


0(x,1)=10(~), Aaa 7B, =). (11.96) 


The expansion (11.95) is in increasing powers of t~' (or strictly speaking, 
in powers of +/t where 7 is a typical time scale introduced by the 
parameters in the equations and initial conditions). To keep the technique 
flexible and see the common features in the use of (11.95) in different 
circumstances, we do not introduce (11.96) explicitly, but we note rather 
that 

dA, 84 074, 


a ex OCA 5} 


=O(A,,5),°*:. (11.97) 


That is, each differentiation increases the order by one. Similarly, 6, and θ᾽, 
are O(1) quantities and any further differentiation increases their order by 
one each time. The increase of order on taking derivatives indicates that 6, 
6, and the A, are slowly varying functions. This is a general feature in using 
(11.95) whether the expansion is in t/t or some other quantity. 

As an illustrative example we take the one dimensional Klein-Gordon 
equation 


Pu τ αἶφι, ΕΝ Bp = 0. 


The expansion (11.95) is substituted and terms of successive orders are 
equated to zero. We have 


(θ02-- α292-- B)A,=0, 
(02-- αΓθ3-- B’)A,—- {2i0,Ao, —2ia70, Ao, + i(0,,— αἴθ... )Ao} = 0, 
(0; " a0? a B*)A, = {2i0,A It 2iaO,A 1x + i(9,, 7 a8, )A 1} = A on " aA gx: 


and so on. The first equation eliminates the corresponding term in the later 
equations. If, further, we introduce 


k=90,, w= --θ., 


ft? 
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the hierarchy becomes 
w’ — ak? — B?=0, (11.98) 
2wA 9, + 2a*kAo, +(w, + a7k,)Ag=0, (11.99) 


2wA ,, + 207kA,, +(w, + a7k,)A,= — (Aon — 7Ao,,), (11.100) 


and so on. 

The first equation is the dispersion relation between w and k, and if 
we prefer to work with these quantities rather than @ itself, the consistency 
relation 


k, +o, =0 (11.101) 


is added. This is exactly the determination of 8,w,k described in Section 
11.5. 
The equation for Ay may be written 


ὃ ( Sedodd) τ ἢ [Σαχάραξ) 0. (11.102) 


Since [40[2Ξ αὖ, and in this case 


e = 5 (w?— ak? — B?)a2, 


this is just the wave action equation (11.81). It 1s interesting that the wave 
action equation rather than the energy equation arises most obviously, 
although of course the energy equation can also be obtained from (1 1.99). 
Notice that this point might pass unrecognized without the Lagrangian 
theory. 

One is usually interested only in the first term of the expansion and 
therefore in the first two equations (11.98) and (11.99). However, once @ 
and A, are determined, A, is obtained by solving (11.100), A, from the next 
equation in the hierarchy, and so on. 

As a special case, it is easy to check that the equations have solutions 
of the form (11.96), and the expansion then agrees with the one obtained 
from the Fourier integrals in Section 11.3. The relevant solution of (11.98) 
and (11.101) is the function k(x/?t) determined from 


= =C(k). 
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Then the various forms of (11.102) all give 
Ag=t~ "7B, =). 


Since k is a function of x/t, this may also be written 


465 t~ 1/23 (k) 


in agreement with (11.28). Of course the function &,(x) is only determined 
by use of the initial conditions. In this particular case the expansion does 
not apply in the early stages and Fourier transforms or some equivalent 
bridge is unavoidable. When A, has been determined, (11.100) may be 
solved for A, and the result is (11.23). In fact it is much simpler to 
determine subsequent terms in the expansion by this direct method instead 
of carrying out stationary phase to high orders. 

The expansions are not limited to the centered wave solution; they 
apply to any wavetrain that is slowly varying in the sense of (11.97). For 
example, we might consider a wavetrain produced by a modulated source 
that provides slow changes in the frequency and amplitude. If x and ¢ are 
normalized variables obtained by dividing the original x and ὦ by a typical 
wavelength and typical period, respectively, the modulations provided at 
the source are functions of εἰ and the appropriate forms of θ and A, are 


0=«~ '9(ex,et), A, =€"A_(ex,et), (11.103) 


where ε is the ratio of the typical period to the time scale of the modula- 
tions. The variables are slowly varying in the sense of (11.97) with € as the 
relevant small parameter. For the Klein-Gordon example, the resulting 
equations are (11.98)-(11.100). These correspond to the successive terms of 
order 1,¢,€?, respectively, but there is no need to display the dependence 
on € explicitly if we follow the ordering in (11.97). 


Nonuniform Media. 


A more interesting case, similar to the preceding one, arises when the 
modulations are produced by slow variations in the medium. For example, 
we might consider an initially uniform wavetrain entering a nonuniform 
medium in which the parameters of the medium change slowly over a 
length scale L. If A is a typical wavelength (say the value in the initial 
wavetrain) the small parameter is e=A/L. In normalized variables, the 
medium will be described by functions of ex, and the form in (11.103) is 
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appropriate to describe the modulated wavetrain. A similar formulation 
applies if the medium changes slowly in time. 
As an illustrative example, we again take the Klein-Gordon equation. 
In a nonuniform medium the equation would usually arise in the self- 
adjoint form 
ap 8 


0 
ἐς ὃ [arnt Ze} +B%xN~=0. (11.104) 


We suppose that x,f have already been normalized with respect to a 
typical wavelength and period (typical values of af τ᾿ and B~' could be 
used), and to include space and time variations in the same analysis we 
suppose that 


a=a(ex,et), βεβίει,, ει). (11.105) 


As before, we do not introduce the dependence on ε explicitly, but work 
directly with (11.95) and (11.104) with the understanding that 


k=6,, w= — 9, A» a, B 


are all O(1) quantities and any increase of derivative or increase in the 
subscript of A increases the order by one. The resulting hierarchy begins 
with 


w? — a2k? — B*=0, (11.106) 
2wA o, + 2a7kAg, + (w, + 07k, + 2kaa,)Ay=9, (11.107) 
and we add the consistency relation 
0k , OW _ 
=e ae τ =(. (11.108) 


The determination w, k, and @ from (11.106) and (11.108) is exactly the 
procedure proposed on more intuitive grounds in Section 11.5, and sub- 
sequently obtained from the variational approach in Section 11.7. The 
consequences were already examined in Section 11.5. In particular we 
noted that values of k propagate with the group velocity dw/dk obtained 
from (11.106), but neither the group velocity nor the value of k need 
remain constant on a group line. 

In this case the group velocity may be written a*k/w, so that the 
characteristics for (11.107) are the same as for (11.108) and the values of 
A, may be obtained in principle by integration along these characteristics. 
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The main point to be noted, however, is that (11.107) may still be written 
in conservation form as 


(524048) + (50449) =0. (11.109) 
t x 


That 15, the wave action equation (11.102) still holds in the nonuniform 
case in which a and the relation between w and k depend on x and ¢. This 
substantiates the claim made in the variational. approach. 

Indeed if we form 


96 98 
ay Ox 


with the energy density and flux from (11.53)(1 1.54) and calculate it from 
(11.106)-(11.108), we have 


: [ ων a°k?+ B)AoA§ | + = [ Σαδωκ! = 
2 Ὁ: 
{eee + aa? , 36" “ar edt (11.110) 
This checks with (11.94), since 
c= + (w?— a7k? — B?)a?. 


The direct use of (11.95) in the equations leads to the required results 
but without the generality and insight of the variational approach. The two 
will be combined in the discussion of Chapter 14. We first consider 
applications of the theory so far, and amplify the ideas on specific 
problems. 


CHAPTER 12 


Wave Patterns 


Some of the most interesting wave patterns are found in water waves. 
Some of them, such as the V-shaped ship wave pattern or the pattern of 
rings spreading out from a stone thrown in a pond, are familiar to 
everyone, and others are relatively easily observed. We start with these. 
Here the dispersion relation, the only input required, will be merely 
quoted. We shall need to look further into the subject of water waves later, 
since it was the first and most fruitful source of ideas on nonlinear 
dispersive waves. The derivation of the dispersion relation will be included 
then. 


12.1 The Dispersion Relation for Water Waves 


In still water elementary solutions for the perturbation ἡ in the height 
of the surface take the basic form (11.1): 


ik-x — iwt 
3 


n= Ae 
provided 


T 


w?=( gk tanhkh (1+ — 
(g ) π᾿ 


Ἢ Κ- Κι. (12.1) 
Here h is the undisturbed depth, g is the gravitational acceleration, p is the 
density, T is the surface tension. In still water the waves are isotropic and 
the dispersion involves only the magnitude k of the wave number vector. 
There are a number of interesting limits which are conveniently used as 
approximations in appropriate circumstances. 


Gravity Waves. 


In c.g.s. units, g=981, p=1, and T=74, so that h,=20(T/pg)'/? = 
1.73cm. Thus the surface tension effects become negligible for wavelengths 
403 
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several times greater than this value. Then we have the usual formula for 
gravity waves: 


w? = gk tanhkh, A>A,,- (12.2) 


For these, the phase and group velocities are 


e(k) =(Ztanhah) (12.3) 


axe ) (12.4) 


Τὶ )( fae Le 
dk 2 sinh 2kh 


Within this approximation, we have the limiting cases 


1/2 1/2 
ees Ὁ ἐ- [5] | c~3(2] » sey 1128) 


w~(gh)"k, c~(gh)'?, C~(gh)'”, kh-0. (12.6) 
For fixed A, both c and C are increasing functions of \=27 /k, with 6 «ς; 
in the long wave limit (12.6), C->c and the dispersive effects become small. 


Of course, the approximation (12.5) is appropriate for short waves when 
AnKAKA. 


Capillary Waves. 


For AA,,, the surface tension effect may be dominant and (12.1) is 
then approximated by 


w= “tanh kh, (12.7) 
In this case 


ΠΝ (12.8) 


τῇ 2kh 
Clk) = Fe(1+ 7 |. (12.9) 


Sec 12.1 THE DISPERSION RELATION FOR WATER WAVES 405 


The further limits of these are 


1/2 
(s)"en 3) 
p p 


1/2 1/2 
Kh/2 c~3(7) Ee Phew, 


ρ 
(12.10) 
and 
1/2 1/2 1/2 
~(7) K2, ~(2) k, c~2 7) κ2, kh—0. 
p p p 
(12.11) 


For capillary waves, c and C are decreasing functions of A, with C >c. 


Combined Gravity and Surface Tension Effects. 


When both effects are important, it is usually sufficient to consider 
relatively short waves, kh>1, and take 


w?=gk+ 7m. (12.12) 


The phase and group velocities are 


g Τ' 1/2 
ἐπ [ξ Ὁ 1 ; (12.13) 
1+ (3T/pg)k? 
ΠΕ Vere Cee (12.14) 


21+ (T/og)k? 


_ The phase velocity has a minimum at k=k,,, where 


1/2 
= κα = 27 W : 
kn = 3 | λ, a 1.73cm; (12.15) 


the corresponding values of c and C are equal with 


Cm = 23.2 cm/sec. 


For A>A,,, often known as the gravity branch, C<c; whereas for \<A,,, 
known as the capillary branch, C >c. For any given value of c>c,,, there 
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are two possible wavelengths. The minimum group velocity is attained at 
A=2.54),, =4.39 cm, C=0.77c,, = 17.9 cm/sec. 


Shallow Water with Dispersion. 


In the limit kh-0, (12.1) may be expanded as 


wo? ghk? 1+(, — S)eaire | (12.16) 
pgh* 3 
and we have 
~—( eh) 2}, 4 41/7 _ 1)p320.... 
c~( gh) 6 {{{2, 5 ea | (12.17) 


When dispersion is neglected altogether, the equations for nonlinear shal- 
low water theory are hyperbolic and similar to those of gas dynamics; this 
so-called hydraulic analogy has been exploited for experiments. The dis- 
persion must be kept to a minimum so ἢ is chosen such that 


that is, 


Magnetohydrodynamic Effects. 


In a conducting liquid a third vertical restoring force may be intro- 
duced when a horizontal magnetic field is applied and horizontal currents 
flow through the liquid. This has been investigated by Shercliff (1969), who 
finds that the dispersion relation is 


pw7=k tanh kh(pg+ k?T + JB, " 


where B, is the magnetic field normal to the wave crests and J, the current 
along them. The term J,B, is the vertical component of the Lorentz force. 
It is interesting that the propagation depends on the orientation of the 
waves to the field and becomes anisotropic. Details of the phase and group 
velocities, and of the various limiting cases, are to be found in the paper 
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quoted. We shall not pursue this case further here, although the various 
wave patterns can be studied by the methods developed below. 


12.2 Dispersion from an Instantaneous Point Source 


The waves from a point source spread out isotropically and the 
different values of k introduced initially propagate out with the corres- 
ponding group velocities C(k). At time ¢ any particular value k will be 
found at r= C(k)t. Hence k(r,t) is the solution of 


C(k)=-. (12.18) 


For deep water gravity waves (12.5), we have therefore 


t 
k= τα ῳ 
r 


(12.19) 


πο 


This is the axisymmetric counterpart of the one dimensional problem 
noted in Section 11.4. This very simple formula for w has been checked by 
Snodgrass et al, (1966) against observational data of the swells produced 
by storms in the South Pacific. At distances of the order of 2000 miles, the 
frequency was found to vary linearly with ¢, and the constant of propor- 
tionality gave a very accurate determination of the distance of the storm. 

On a smaller scale, the typical rings spreading from a stone or other 
splash in a pond satisfy (12.18) with C(x) given by (12.14). Since C(k) has 
a minimum value of about 18 cm/sec, there is a quiescent circle of radius 
18¢ cm. Beyond that. there are two values of k for each r/t, one on the 
gravity branch and one on the capillary branch, so there are two super- 
imposed wavetrains. Of course, the energy in the different wave numbers 15 
determined by the initial disturbance. Waves with wavelength of the same 
order as the size of the object will have the largest amplitudes and will be 
most accentuated. For an object of dimension /, the main rings will be 
around r= C(a/J)t. 


12.3 Waves on a Steady Stream 


The waves produced by an obstacle on a steady stream U in the x, 
direction may be viewed as the waves produced by an obstacle moving 
with speed U in the negative x, direction. For a two dimensional obstacle, 
with flow independent of x,, the only waves that can keep up with the 
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obstacle and appear steady when viewed from the obstacle must satisfy 


c(k) =U. (12.20) 


We again take the situation when (12.12)-(12.14) apply. There will be no 
solutions to (12.20) and hence no steady wavetrain if U<c,,. In this case 
there will be local disturbances dying out away from the obstacle but no 
contribution to the asymptotic wave pattern. If U>c,, there will be two 
solutions of (12.20): one of them, k, say, on the gravity branch, and one, 
ky say, on the capillary branch. Now k,<k,, and ky >k,,; hence from the 


properties of (12.13)-(12.14), we have 


C(k,) <c(k,) = U, (12.21) 
C(kp) >c(kp) =U. (12.22) 


Therefore, the gravity waves k, have group velocity less than U and will 
appear behind the obstacle; the capillary waves Κα, have group velocity 
greater than U and will be ahead of the obstacle. The resulting pattern is 
shown in Fig. 12.1. 


IYO ON 


Ag 


——— 
U>Cpy, 


Fig. 12.1. Sketch of capillary waves (upstream) and gravity waves (downstream) produced 
by an obstacle on the surface of a stream. 


This is an interesting use of the group velocity concepts to determine 
the correct radiation condition in a steady flow problem. For this reason it 
is also interesting to derive the result in detail from the exact Fourier 
transform solution and see how the group velocity condition comes out of 
the usual type of radiation condition used in the techniques of solving 
boundary value problems. At the same time, the full solution gives the 
amplitudes of the waves. The asymptotic analysis 6115. 15 only that the 
amplitude remains constant in each wavetrain; the detailed initial condi- 
tions have to be analyzed to determine their values. It would interrupt the 
present discussion of kinematics to give the details here. They will be given 
in Section 13.9. 
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12.4 Ship Waves 


For an obstacle that is finite in the x, direction we have a two 
dimensional wave pattern on the surface of the water and the analysis is 
more complicated. We shall study only the gravity wave problem for deep 
water and use the dispersion relation (12.5). This covers the pattern 
produced by any object of dimension />A,, moving on water with depth 
h> I; this is the usual situation for ship waves. 

The most striking result, originally due to Kelvin, is that in deep water 
the waves are confined to a wedge shaped region of wedge semiangle 
sin~!4=19.5°. This result is independent of the velocity provided the 
velocity is constant; it is independent of the shape of the object, and it 
depends only on the fact that C/c= 4 for deep water. 


Fig. 12.2. Construction of wave elements in ship wave problem. 


A concise form of the argument is given by Lighthill (1957). Consider 
the “ship” to move from Q to P in Fig. 12.2, in time ¢, and let its speed be 
U. For a wave crest to keep a stationary position relative to the ship, 


Ucosy=c(k), (12.23) 


where Ψ is the inclination of the normal (direction of k) to the line of 
motion QP. This condition is most easily seen by taking the frame of 
reference in which the stream of velocity U flows past a stationary ship; 
the stream has component Ucosy normal to the wave element and this 
must be balanced by the phase velocity of the element. The condition tells 
us the value of k to be found in the direction y. It may be represented 
geometrically in Fig. 12.2 by constructing the semicircle with diameter PQ 
and noting that PO= Ut, SQ=Utcosy=ct. Therefore wave elements 
parallel to PS will have ct=QS. Now c is the phase speed and is 
appropriate for condition (12.23), but the group velocity C= 4c determines 
the location of these waves. The waves produced at Q will have traveled a 
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Fig. 12.3. Envelope of the disturbance emitted at successive times. 


distance Ct=4ct. Therefore in the direction y they will be found at T, the 
midpoint of QS. Including all values of ψ, we deduce that those waves 
produced at QO which can contribute to a stationary pattern lie on a circle 
of radius 1 Ut centered at R, where PR = 3 Ut. Finally, varying ¢t for a fixed 
point P, we have the pattern of circles of Fig. 12.3. From the construction 
in Fig. 12.2, each circle has a radius one third the distance of its center 
from P. Hence they fill a wedge-shaped region with semiangle sin~'4= 
19.5°. It is amusing to note that the construction in Fig. 12.3 is the same as 
for supersonic flow with Mach number 3; all swimming objects have 


effective Mach number 3. 


Further Details of the Pattern. 


In discussing the pattern in more detail, it is convenient to take the 
reference frame in which the source is fixed at P and there is a uniform 
stream U in the x, direction (see Fig. 12.4). This raises some general points 
about handling steady patterns which are also useful in other contexts. The 
dispersion relations in Section 12.1 apply to waves propagating into still 
water, but we may transfer to any other reference frame moving with 
relative velocity —U by noting that the frequency ὦ relative to the moving 
frame is given in terms of the frequency w, in the stationary frame by 


w=U-k+ (k). (12.24) 


This is the dispersion relation between w and k for waves superimposed on 
a stream U. Of course the propagation is no longer isotropic since the 
direction of U enters. For a steady wave pattern in this frame, w=0 and 
(12.24) becomes a relation between the components k, and k, of the wave 
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number vector k. With w,.(k)= V gk , we have 


G(k,,k,) = Uk, + Vgk =0. (12.25) 


Since cosy = —k,/k and c(k)= Vg/k , this is the same as (12.23). We 
may also write it using the polar coordinates (k,w) for k as 
G(k,W) = Ukcosw— Vgk =0. (12.26) 


Since the frequency ὦ is zero and Καὶ is independent of ¢, the kinematic 
description (11.43) reduces to the consistency relation 


ὌΣΤΕ ΕἾ. (12.27) 
0x, 0x, 


From (12.25), k,=f(k2), say, and (12.27) gives 
0k, 0k, 
ae, —f' (ky) ae 0. 


Hence k, and k, are constant on characteristics 


dx, 
ae —f'(k). 


For a point source P, the characteristics carrying disturbances pass 
through P and we have a centered wave 


xX 


2 -- - Ἐἐ! . 
a f'(k,); (12.28) 


this gives k, as a function of x,/x,, and k,=f(k,) completes the solution 
for k. 

The basic relation (12.28) can be written symmetrically in k, and k,. If 
k,=f(k,) satisfies (12.25) identically, 


F'(ky)G, + σι, =0, 


and (12.28) may be written 


X> Gy (k,,k2) 


=: G(k,,k,)=090. 12.29 
πῆ — Otvkd (12.29) 
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These are to be solved to give k, and k, as functions of x. The distribution 
of k is sufficient to sketch out the pattern, but the phase 0(x,,x,) can also 
be deduced to give the equations of the crests. 

It might be noted that (12.29) is the limit of the nonstationary 
centered wave solution as w—0. For the centered wave solution in (11.49) 
we have 


'=C=-—, G(kw)=0. 


If we take the ratios of the first set to eliminate ¢ and G.,,, (12.29) follows as 
the limit ὠ-50. We can think of the disturbance propagating out with the 
group velocity C, even though its form is unchanging and there is no 
change in the appearance of the pattern. We may refer to the group 
velocity in this sense even though it is only its direction dG/dk, that 
appears in the formulas. 

A further remark is that polar coordinates are sometimes useful, as in 
(12.26). In polar coordinates the gradient 96 7,9 Κ has a component 0 § /dk 
in the direction of k and a component 9 9 /k dy perpendicular to k. Hence 
the angle u in Fig. 12.4 is given by 


_ 199 ὃψ 
tanu=7 - 9 ak (12.30) 
The content of (12.29) is then equivalent to 
f=a—-p-y, $(k,y)=0. (12.31) 


Equations 12.30 and 12.31 determine k and wy (and hence k) for the 
direction &. 
These formulations apply to any steady two dimensional pattern, and 


k aG/dk 
μ 


-------. 
υ Ρ ΧΙ 


Fig. 12.4. Geometry of wave crests in ship wave problem. 
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Fig. 12.5. A complete wave crest in ship wave problem. 


we now apply them to ship waves. Using (12.25) in (12.29), we have 


Κχ. 8 

ee ee. Uk, + Vgk =0. 
ἯΙ {7 k, § 
+ aK VK 


It is clearly more convenient to switch to the (k,y) description of Κα and 
reduce these to 


tany g 


tan £= --------, k= —_.. 12.32 
ξ Ι--ἰιαηῖψ U*cos*p ( ) 


If the approach via (12.30)(12.31) is preferred, we have 


tanu= —2tany, (12.33) 
and (12.32) follows. 

We may now sketch out a typical wave crest as Ψ varies. From (12.25) 
or (12.26), Κι <0 and cosy >0, so that only the range —7/2<y< a /2 is 
permissible. The pattern is clearly symmetrical and it is sufficient to take 
the range 0<W<2/2. From (12.32) we see that the values y—0 and 


Fig. 12.6. Final ship wave pattern. 
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Y-7/2 are both to be found on €=0, and there must be a maximum value 
of € in the range. It is easily verified that the maximum value is 


δὲ =tan~'!—!_ = 19,59 at ¥,=tan~!—L =35,3°, 
2.2 ν2 


This agrees with the wedge angle found earlier and shows that the wave 
pattern is confined to this wedge. At the maximum ~#7/2 therefore the 
wave crest can not turn back smoothly; there must be a cusp at y=y,, on 
the boundary of the wedge. We may then complete the shape as given in 
Fig. 12.5 and the whole pattern must appear as in Fig. 12.6. 

The formula for the phase function (x) may be found from 


xX 
θ-- [ κ΄ ἄχ, (12.34) 
0 
. using any convenient path since k is irrotational. Obviously the rays 
€=constant are convenient since k remains constant on them. We have 
6=(kcosp)r, (12.35) 


where r=|x| is distance from the origin. Here k and p are functions of € 
given by (12.32) and (12.33). A phase curve ?=constant is given parametri- 
cally in terms of by 


ee --% 
Sor cos*y{1+4 tan? yy 


tany 


tan § = ——_—___- , 
: 1+2tan*y 


and @ is negative. These may also be written 


x,=- OB cosy(1 + sin’), 
(12.36) 


2 
Xy=- cos” siny. 


12.5 Capillary Waves on Thin Sheets 


One can study steady patterns of capillary waves in similar fashion, 
and a particularly interesting setting is Taylor’s study (1959) of waves on 
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thin sheets of water. Surface tension is the dominant effect and the sheet 15 
thin enough to make the approximation h/A<1 appropriate. In one mode 
the sheet deforms as a whole, keeping roughly constant thickness, and in 
this mode (antisymmetric disturbance of the two surfaces) the waves are 
not dispersive. For the symmetric mode, however, in which the two 
surfaces oscillate symmetrically away from the central plane, there is 
dispersion. We may apply (12.7) taking 2h to be the thickness of the 
undisturbed sheet, since the plane of symmetry is equivalent to a solid 
surface for each half of the sheet. Since the sheet is very thin the 
approximations (12.11) may be used. The wave pattern from a point source 
in a stream U may be analyzed from the general formulation in the last 
section using now the dispersion relation 


1/2 
G= Uk, +( 7) k2=0. (12.37) 


For a uniform sheet with uniform flow, U and h are constants. In that 
case, we immediately find from (12.29) that 


1/2 


X> 2ak, [3] (12.38) 


rhe U+2ak,’ a p 


From (12.37), k= Ucosw/a in the (k,w) description of k and the 
characteristic relation (12.38) reduces to 


--- thatis, €=7—2y; (12.39) 
an 


it follows from (12.31) that the angle μ is equal to y. This time, Ψ ranges 
from 0 to 7/2, ξ ranges from — 7 to 0, and we infer the roughly parabolic 
crests shown in Fig. 12.7. The phase function is 


οἱ 1/2 
#=(keeosy)r=( Th rsin’ > ; (12.40) 


successive crests are curves rsin?£/2=constant. 

Taylor also conducted experiments and developed the relevant theory 
for waves on a radially expanding sheet. In the undisturbed sheet the 
radial velocity, V say, may be taken constant since the pressure gradients 
are at most O(h) and may be neglected. As a consequence, the semithick- 
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y=n/2 
h 
γ "παν 
77 
ψτο \\ 
A 
y 9 /4Fh 
, 
y 2-1/2 


Fig. 12.7. Pattern of wave crests on a thin sheet of water. 


ness ἢ is a function of the distance R from the center of symmetry (the 
source of the flow) given by 


ἘΝῚ 
h= 5: (12.41) 


where Q is the total volume flow. Because h depends on R, we have an 
example of waves on a nonuniform medium. Well away from R=O, the 
medium is slowly varying relative to a typical wavelength and we may 
apply and illustrate the ideas of Sections 11.5 and 11.7 on nonuniform 
media. In polar coordinates (R,@) based on the source of flow, not on the 
source of the waves (see Fig. 12.8), 


k= (02.758), U=(V,0), 


and to adjust for the radial flow the dispersion relation (12.37) is modified 
to 


1/2 
Th(R ) 1 
γθ, τ 03+ 2562) =0 
R | ρ [ R R2 ῶ 


From (12.41), this dispersion relation may be written 


1/2 
V3 
G= 5 (02+ 1562) + BR, το, p= [55 | . (12.42) 
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choracteristic 


source source 
of flow of waves 


Fig. 12.8. Construction detail for waves on a radially expanding sheet. 


It may be solved by the method of characteristics, but the details are more 
complicated than in the corresponding steps (12.27)-(12.29) since k is not 
constant on the characteristics nor are the characteristics straight lines. 
However, we may find the characteristic form from the general formulas of 
Section 2.13. If we let p=0,, g=0,, we have 


aR _o= 2 Δ ς-.3 
dt C, aa dt G, R2’ 
dp qg | 

Ger Pa ἢ 

dq 


where 7 is a parameter on the characteristic. Since ᾧ 15 constant on each 
characteristic, it becomes a convenient characteristic variable and, from 
(12.42), 


2 
4 
7.) "ὦ (12.43) 
The characteristic curves are given by 
diy 47: q 


dR p+BR'” BR5/2(1 — q?/B?R3) 


These pass through the source, at R= Rp, ®=0, say, and the appropriate 
integral is 


3/2. Σ 

R sin(o — (3/2)@ 
(= J ates  Ὑ|ὁ} ee ΘΒ. (12.44) 
R sino BR3” 
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This equation for the characteristics may be solved to express gq as a 
function of R and ©, and then (12.43) used to obtain p. We have 


BR?/* sin(3/2)é 


{ (R/Ry)?—2(R/ Ro) cos(3/2)+1} 


BR'/?{ (R/ Ry)’ —cos(3/2)a} 


=6,= 
. {(R/Ry) —2(R/ Ry)” cos(3/2)a+ 1 


- BR ue 
72 
Ἢ 


and, finally, 


2 ΑΥ̓͂ (Ry? 3...) 2 (Ame ee 
= = BR3/?}| — }) --2.ὄ — ~H+1} +=BR3” ι-[] : 
θ 3 BRo ΕΣ] { 2) cos & | 3B 5 | R, 
(12.45) 


This result was first found by Ursell (1960b) in an amplification of Taylor’s 
arguments. For small ὥ, we have 


κι." -ὶ 
)-- ξραβλο»} ι- [59] | : (12.46) 


this agrees with Taylor’s equation for the crests and it compares well with 
the experiments. 

This particular case shows the power of the kinematic arguments, 
since any direct attack on the boundary value problem involved here 
would be a formidable undertaking. 


12.6 Waves in a Rotating Fluid 
For small perturbations in an incompressible fluid which has a basic 


flow velocity U along the x, axis and a solid body rotation with angular 
velocity 2 about that axis, the linearized equations are 


Du, δι, aP 
pre et op en ae 
Du, oP du, du, du, Στὴ 
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where 


oe δ Ae oy eee D_9%, 79 
P ᾿ 5 Ω (χ᾽ + x3), τὰ U Pe 


The velocity perturbations may be eliminated in favor of P and the single 
equation 


2 
2 
ὃ. y9_) y2p+4g29F 9 (12.47) 
dt 0x3 dx} 
obtained. 
When U=0O, the reduced equation for periodic disturbances 


P= Pe is 


2 2 2 3 
9729 oF 4(1- 2S - 


+ 
dx? 0x3 w? } ax? 


0. (12.48) 
The change of type from elliptic for w>2Q to hyperbolic for ὦ « 2 leads 
to both interesting phenomena and interesting mathematical problems. For 
w >2Q the disturbance from a source will have the typical 1/r? decay of a 
doublet solution of Laplace’s equation, whereas for w<2Q it will be 
confined inside the characteristic cone of semiangle tan~ '(4Q?/w*— 1)7'/? 
around the x, axis. For flow inside a container the boundary conditions 
are of elliptic type, which leads to unusual eigenvalue problems in the 
hyperbolic case w<2Q. Solutions have been found for special shapes 
(Greenspan, 1968; Barcilon, 1968; Franklin, 1972). 

When the stream U is included, the dispersion relation for (12.47) is 


(w — Uk) k?—407k2 =0. (12.49) 


Waves are possible only when (w— Uk)? « 422; for U=0 this checks with 
the condition for (12.48) to be hyperbolic. We have two modes satisfying 


20k, 
w= Uk,+ ΠΝ, (12.50) 
and the group velocity has components 
ng K1k3 — ag Kaks (kj + k3) 
Cy= Ἐ20 τς", (,-- Ἐ20 το, ΟΞ Ἐ20.---τς (12.51) 
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For a point source of constant frequency w on the x, axis, the distribution 
of k is determined from 


X3 C; 


= —____.. 12.52) | 
1/2 1/2 ( 
(xf + x3) (C?+ Cz) 
When U=0 this reduces to 
“3 aap (k?— K3)'”” 
(x24x3) Is 
1/2 
2 
-+(42-1] | (12.53) 
@ 


The disturbance is found on the characteristic cone in agreement with 
(12.48). 

When U0, there is dispersion even for fixed w and different values 
of k satisfying (12.49) are dispersed over different cones. Complete wave 
patterns can be worked out by the techniques developed here; the results 
can be found in the paper by Nigam and Nigam (1962). But perhaps the 
most interesting questions concern the wave propagation view of the 
Taylor column. 

In a famous experiment Taylor (1922) found that when a sphere is 
pulled slowly along the axis of rotation a whole cylindrical column of fluid 
circumscribing the sphere is pushed along with it. The complete analysis of 
the phenomenon is difficult (see Greenspan, 1968, p. 192), but some 
information can be obtained from the wave kinematics. We take the steady 
frame of reference with main stream U. For waves to appear upstream, 
they must have C,<0; hence 


20(k? + k? 
(i+) ς 
k3 


The most favorable case is for k,=0, which would correspond to the 
surface of the Taylor column. To be upstream the waves must have 
22/k>U, or equivalently, A> Um7/Q. We should expect the dominant 
wavelengths produced to have A=O(a), where a is the radius of the 
sphere. Indeed Taylor found a column when Qa/aU > 1 and this would fit - 
exactly with the choice A=a. Subsequent experiments and theory indicate 
that the transition is not sharp, and this result should be taken as an 
estimate of when the Taylor column will be reasonably strong. 
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12.7 Waves in Stratified Fluids 


Gravity waves in a density-stratified fluid are of great interest in 
meteorology and oceanography. The basic density gradient may be estab- 
lished by heating or salt content or other effects, but it is often desirable to 
eliminate compressibility and sound waves in the subsequent motion. To 
achieve this the continuity equation 15 split into two parts: 


“ΡΟ Yeas 

— = 3 “411 ΞΞ Ἂ 

δι 

and both are retained! The density is not constant but is assumed to be 
unchanged following a particle path in the wave motion. To these equa- 
tions are added the momentum equations 


Du_ io 
PH, =~ VP — 08: 


The double use of the continuity equation is in lieu of an energy equation 
and we have a complete system. The results and approximations can be 
checked against more complete descriptions, the main requirement being 
that the sound speed should be much greater than the wave speeds found 
in this theory. 

For two dimensional flow in the (x,y) plane with stratification in the y 
direction, we take the undisturbed distributions to be u=v=0, p=p,(y), 


P=p,(¥) with | 


AP 
dy + p92 =0, (12.54) 


and we linearize for small perturbations about these values. If the pertur- 
bations of p and p are denoted by p, and p,, respectively, the linearized 
equations are 


01, + VP9 = 9, u.+v,=0, 
Pot, + Pix =9, ρρῦ Ἔριν + 89, =9. 


A single equation can be deduced; in terms of a stream function V defined 
by v=. ὉΞ -ΦΨ'. {15 


ρο χε + (ρον, yt SPoY xx =0. (12.55) 


It is convenient for the. wave motion to have an equation with even 
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derivatives only, which may be achieved by the substitution ¥ =p, '/x to 
give 


2 ” / 
Po Po EPo 
Χχχε + Xyyu t (2, ae he Xxx = 9. (12.56) 


The special case of an exponential distribution, p»xe~%, has the attrac- 
tion of constant coefficients and can be matched reasonably well to other 
distributions. In that case the dispersion relation is . 


(K+ K+ 4a?) —agk?=0. (12.57) 


In many situations the interesting wavelengths are in the range k>>a, and 
(12.57) is approximated by 
27,2 , 
2 ὠρκὶ 2 EPo 
wo =>, wWo=ag=—-—. 12.58 
Kee Ke? OB og Ceo 


The frequency w, is the Vaisala-Brunt frequency; it is constant for the 
exponential distribution and would be a function of y in the more general 
case. 

We note that waves are possible only in the case w<w,, and the 
situation is somewhat similar to the example of rotating fluids. For a 
source with fixed frequency ὦ <p, one solution of (12.58) may be taken to 
be 


k,=kcosy, k,=-—ksiny, (12.59) 
where 


p=cos-'“; (12.60) 
Wo 


whatever the magnitude of k, the waves all have the fixed inclination w 
with the x axis. The corresponding group velocity has components 


ὡρκ2 Wok k, 

ον ae a (12.61) 
Since k-C=0, the phase and group velocities are perpendicular. Hence the 
group velocity is at an angle £=2/2—vy to the x axis. The direction of the 
group velocity determines where the waves will be found. In view of 
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(12.60), this direction is the same for all the waves and is at the angle 


ἔξ 5 —yssin7! (12.62) 


with the x axis. When all the possible signs of k, and k, are included, the 
disturbance forms an X-shaped pattern. The crests also lie on the X but 
move locally normal to the arms, continually dying out as they leave the 
pattern but being replenished by new ones appearing behind. (Of course 
each arm has a finite thickness in reality because the source is finite.) 

Excellent photographs were obtained by Mowbray and Rarity (1967a) 
and are reproduced in Fig. 12.9. The source is an oscillating cylinder 
normal to the plane of the photograph and oscillating horizontally; the 
vertical rod is a probe. The source introduced also a faint but discernible 
second harmonic with frequency 2w. This gives the pattern with angle 
sin” '2w/w,. In this paper and later ones (Rarity, 1967; Mowbray and 
Rarity, 1967b), these authors investigate the theoretical patterns in detail 
and also study the pattern produced by a moving sphere. In reality the 
distribution of density is not quite exponential and the bending of the 
group lines due to the dependence of ὡρ on y can be seen in some of the 
photographs. This illustrates the effects of a nonhomogeneous medium, 
and the variations could be analyzed by the kinematic methods developed 
in the last chapter and applied in Section 12.5. 


12.8 Crystal Optics 


In crystals the anisotropic properties of the medium produce striking 
effects in the wave patterns. The structure of the crystal produces direc- 
tional effects in the dielectric properties and the relation between the 
displacement vector D and the electric field E must be described by a 
tensor relation. The usual relation B=p,H suffices for the magnetic 
vectors. The effects are described by taking the constituitive relations 


D,=«¢,E, B= po; (12.63) 


yy? 


in Maxwell’s equations. In general the dielectric tensor ε; will depend on 
the frequency w. This may be accommodated by using the relations (12.63) 
only after the time dependence e~‘“’ has been taken out. For a plane wave 
with all components of the field vectors proportional to e*~“", Maxwell’s 
equations reduce to 
— iwB + ik x E=0, 
(12.64) 
iwD + ik x H=0. 


Fig. 12.9. (1) The image of the undisturbed fluid. (2) w/w g=0.318. (3) w/wg=0.366. 
(4) w/w9=0.419. (5) w/w =0.615. (6) w/wg=0.699. (Mowbray and Rarity, 1967a.) 
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Fig. 12.9. (Continued) (7) ὦ ωρΞ 0.900. (8) w/wa= 1.11. (Mowbray and Rarity, 1967a.) 


Since ΒΟΉ. it follows that k, D, and H are mutually orthogonal, so that 
D and H are transverse to the direction of propagation. Since E is 
orthogonal to B, it lies in the same plane as D and k but is not transverse 
to the direction of propagation in general. When B and H are eliminated 
from (12.64), we have 

«7tgD + k X (k X E)=0. 
On substitution for D in terms of E, this gives 


ωὗἶμρεῃ E, + k,k,E;— k*E,=0. 


The dispersion relation then follows from the condition that the 
determinant 


Ο(ω,Κ) =|w7y9¢), + kk, — k78,| =0. (12.65) 
0} J " 


In pursuing details of the wave patterns it is convenient to choose 
coordinates along the principal axes of ¢,. If the principal values are 
€,.€,€3, (12.65) may be expanded to 


+ wpgk? { εἰκ ξ + € kj + ε3Κ3} =(). (12.66) 
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9 b 


Fig. 12.10. (a) Dispersion surface in k space. (b) Phase surface in x space. 


For a source of fixed frequency w, (12.66) describes the surface in Κα 
space which determines the possible wave numbers k of the wave elements 
produced. For an admissible value of k, the corresponding group velocity 
15 

Gy 


ο()Ξ- τ: (12.67) 


ω 


it is in the direction of the normal to the surface (12.66), as shown in Fig. 
12.10a. This geometrical correspondence between C and k is useful in 
determining wave patterns. 

A dual surface is also useful. It may be constructed in terms of the 
phase surfaces produced by a periodic point source at the origin. This 
particular problem is not the one of most interest in crystal optics, since 
one would not normally envisage a source imbedded in the crystal, but it is 
a convenient route to the construction and the analysis applies to 
anisotropic waves generally. 

Wave elements with wave number k are found in the direction C(k) 
from the source. Therefore in each direction C from the source we can 
determine the corresponding value of k for that direction (see Fig. 12.106). 
But the phase surfaces generally do not move out on the group lines with 
speed C. The phase velocity has magnitude w/k in the direction k. Hence 
the point of intersection between a phase surface and a group line moves 
out with speed w/(kcosu), where μ is the angle between C and k. The 
phase surface leaving the origin at time ¢=0 will be at 


x= =t= "Cr (12.68) 


at time ¢. Varying k over all values satisfying (12.66) gives the phase 
surface. 
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An alternative derivation is to note that the phase function θΘ(Χ, ἢ) is 
given by 


O(x,1)= των + f "k-dx, (12.69) 
0 
and the integral can be taken along the group line to x. Therefore 
6= —wt+kcosp|x|. (12.70) 


The phase surface @=0, which left the origin at t=0, will be a distance 


ω 
= t 
" kK cos pt 


out along the group line with direction C. Hence (12.68) follows. 
In optics it is usual to introduce the ray vector s defined by 


᾿ς ω 
s= ΠΕ , (12.71) 
where Cc, is the speed of light in vacuo. Then the phase surfaces are given 
by 
X=SCof. (12.72) 


Thus 5 is proportional to the group velocity but 15 reduced in magnitude to 
give the phase propagation along the ray (group line) as a fraction of Cp. 
Since s is a function of k and conversely, the dispersion relation (12.66) 
may be used to find the corresponding surface in s space. Since this s 
surface is the canonical shape for the phase surface, the normal at any 
point is in the direction of k. Thus, using s in place of C, we have dual 
properties between the k and s surfaces. In optics it 1s also usual to work 
with the refractive index n=c,k/w in place of k. Then we have in addition 
s-n=1. For any point on the n surface, the corresponding 5 has the 
direction of the normal and its magnitude is the inverse of the perpendicu- 
lar distance from the origin to the tangent plane at the point. Conversely, 
on the dual s surface, the corresponding n is in the direction of the normal 
and its magnitude is the inverse of the perpendicular distance to the 
tangent plane. 

In the special case when the dispersion relation is homogeneous in 
w,k,,k,,k3, the dispersion function has the property 


G(pw, pk, pk,, pk) =v 0 


for arbitrary p. Hence differentiating with respect to p and then setting 
p=1, we have 
wG,, + k;G, =0. 
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Therefore k-C=w. In this case, but only in this case, (12.68) reduces to 


x=Ct, 


and the phase surfaces move out along the group lines with speed C. The 
difference between the group and phase velocities is just compensated for 
by the inclination factor cosy. This special case applies to (12.66), for 
example, if the ες, are taken to be independent of w. 


Uniaxial Crystals. 
In the case of a uniaxial crystal with symmetry about the x, direction, 


we have e,=e, and the common value will be denoted by ε,. The 
dispersion relation (12.66) then factors to give the two possibilities 


w? = (12.73) 


ie 1 k3 +k? 


€1 Ko €; Uo 


(12.74) 


One would expect anisotropy to distort the waves, but it is perhaps 
unexpected that splitting would occur and one family remain isotropic. 
The interesting phenomena in crystal optics stem primarily from this 
splitting. 

We now have two surfaces in k space as shown in Fig. 12.114. The 
waves described by (12.73) are isotropic with speed (€, i) '/? and they 
are called ordinary waves. The surface in k space is a sphere, the group 
velocity is parallel to k, and dispersion arises only if «, depends on w. The 
other family (12.74) is suitably called extraordinary and the surface in k 
space is an ellipsoid; these waves are dispersive. The surfaces are shown in 
Fig. 12.1la for the case e, ει. There are two group velocities Cy and C, 
for each k. As a consequence there will be two phase surfaces as shown in 
Fig. 12.11. For the ordinary waves we have C,xk and the phase equation 
(12.68) is 


ree er | (12.75) 


εἰ Moe 


Using (12.73) to eliminate k, we have the phase surfaces 


Eo X24 xG+ x2) = εἶ; (12.76) 
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a b 


Fig. 12.11. (a) Dispersion surfaces in k space for a uniaxial crystal. (b) Phase surfaces in x 
space for a uniaxial crystal. 


these are just ordinary spherical waves with speed (€, μο) '/*. For the 
extraordinary waves, 
ke Rs. Κα 
co 23] (12.77) 
€, © εἰ 
and the phase equation (12.68) is 


k k k 
x-| a ah, (12.78) 


> 5 
ξ, βρῶ ἐγ μρῶ ἐξ μρὼ 


From (12.74), the phase surfaces are ellipses 
εἰ μοχί + €; Mo x3 + x3) = 0? (12.79) 


The canonical s surfaces are obtained from (12.76) and (12.79) with 
X=SCof. 

For the extraordinary waves the direction of propagation for waves of 
wave number k is given by (12.77). Waves with k at an angle y to the axis 
of symmetry propagate at an angle € given by 


tant= τ tany. (12.80) 
1 
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LF. 


Fig. 12.12. Dispersion surfaces in k space for a biaxial crystal. 


ky 


As a consequence of the splitting, a beam incident on a uniaxial 
crystal will usually be refracted into two separate beams. The refracted 
beams are determined by the continuity of the tangential component k, of 
k. But for the given incident k, there will be two possible wave vectors k 
that satisfy (12.73) and (12.74), respectively. The refracted beams travel in 
the directions of the corresponding group velocities. 


Biaxial Crystals. 


For biaxial crystals with €,,€,€, unequal, the surface (12.66) consists 
of two sheets with four isolated points of intersection instead of the circle 
of contact between the sphere and ellipsoid of Fig. 12.11a. One octant is 
indicated in Fig. 12.12 for the case εἰ <€,<e,. The point P is one point of 
intersection and there are three others symmetrically placed in the other 
quadrants of the k,,k, plane. At a singular point the normal can take any 
value lying on a cone of directions at the point. If a beam of light enters 
the crystal normally in this direction, a cone of refracted rays is produced. 

Further details of this become complicated and require a lengthy 
account. These and other questions may be pursued in the excellent 
accounts by Sommerfeld (1954, Chapter 4) and Landau-Lifshitz (1960a, 
Chapter 11.) 


CHAPTER 13 


Water Waves 


Many of the general ideas about dispersive waves originated in the 
problems of water waves. This is a fascinating subject because the 
phenomena are familiar and the mathematical problems are various. We 
now turn explicitly to this topic. First we substantiate results referred to 
earlier, amplify specific details, and include a few problems special to the 
subject. Then we take up the nonlinear theory which first provided some 
insight into the questions of how nonlinearity affects dispersive waves; this 
eventually led to a general point of view on such questions. It will serve the 
same purpose here, providing motivation for the general discussion and for 
the study of similar phenomena in different contexts. 


13.1 The Equations for Water Waves 


We consider an inviscid incompressible fluid (water) in a constant 
gravitational field. The space coordinates are denoted by (x), x,y) and the 
corresponding components of the velocity vector u by (u,,u,,v). The 
gravitational acceleration g is in the negative y direction. The inviscid 
equations are given in (6.49); we now assume in addition that the density p 
remains constant and that there is an external force F= —pgj, where j is 
the unit vector in the y direction. The equations are 


V-u=0, (13.1) 
Oe = Se +(u-V)u=— 2 Vp— gi. (13.2) 


In the main problems of water waves, the flow may be taken to be 
irrotational, curlu=0, and a velocity potential φ introduced with u=V@q. 
This may be argued, as usual, from the equation for the vorticity «=curlu. 
Equation 13.2 is first rewritten in the form 

du ᾿ 


] 
— -- 2 = — — — ἢ 
ry +¥(5w)+oxu ee gj. (13.3) 
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Then if the curl of this equation is taken to eliminate the pressure, we have 
Helmholtz’ equation 


τῷ 40x (wxu) =0. (13.4) 


Since V-u=0O, it may also be written 


Be = τῷ + (WV)e=(o-V)u. (13.5) 
Now w=0 is a possible solution, and the solution is unique provided, say, 
that all components of Vu are bounded. Therefore if w=0O initially, it 
remains so for all time. In water waves, typical problems concern propaga- 
tion into water at rest or through a uniform stream. In both cases w=0 
initially and the argument applies. We restrict the discussion to irrotational 
flows. 
When u=Vgq, (13.3) may be integrated to 


20 = B(1)-9,-5(V9)—y, (13.6) 
Po 

where B(t) is an arbitrary function, and py is an arbitrary constant 
separated from B(t) for convenience in applying the free surface condition. 
Clearly, B(t) can be absorbed into » by choosing a new potential q’= 
gy — { B(t)dt. Usually we assume this is done and take 


u=Vq, 
(13.7) 
| i: δ! 2 
BT Ne) ey 
From (13.1) the equation for φ is Laplace’s equation 
Vp =0. (13.8) 


When the solution of (13.8) has been found for the relevant boundary 
conditions, the interesting physical quantities u, p are given by (13.7). This 
sounds simple enough, and it appears to have little to do with waves since 
Laplace’s equation is involved. Both reactions are wrong because of the 
curious effects of the free surface conditions. 

We consider the case of a body of water with air above it (although 
clearly the interface could be between any two fluids) and let the interface 
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be described by 
f(x,» y,t) =9. (13.9) 


The interface is defined by the property that fluid does not cross it. Hence 
the velocity of the fluid normal to the interface must be equal to the 
velocity of the interface normal to itself. The normal velocity of a surface 
defined by (13.9) 15 


=! 
Vi Pe iy 
The normal velocity of the fluid is 
ως Ὑ uy f.,+ vf, 
Vie Hed, 
The condition that these be equal therefore 15 


Df 
Dp wet Mite, + Mate, + Oh = 0. (13.10) 


This shows that particles in the surface remain there, and the condition 15 
often introduced directly on these grounds. It is easy to have misgivings 
about the direct statement, however, and it seems preferable to derive it as 
above from the basic property of an interface. 

In working with the equations it is convenient to describe the surface 
by y=(x,,X>,t), and choose 


f(x), Xn¥t) = (X44, X20) —y 


in (13.10). This gives the boundary condition in the form 


Dn 
Dp μη, Ἐ Hate, = Ὁ. (13.11) 

Equation 13.10 or 13.11 is a kinematic condition on the boundary. 
There is also a dynamic condition. Since the interface has no mass, the 
forces in the fluids on the two sides must be equal. Hence if surface tension 
is neglected for the present, the pressure in the water and the pressure in 
the air must be equal at the surface. Any disturbance of the surface clearly 
implies some motion of the air. But the argument is made that the change 
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in the pressure in the air due to the motion is negligible, and the air 
pressure may be approximated by its undisturbed value. This is because 
the density of air is very small compared with that of water, and changes 
of pressure are of order pu’. The assumption can be confirmed in detail by 
including the motion of the air in typical examples (see Section 13.7). If the 
motion of the air is neglected on this basis, the second boundary condition 
becomes p = p, where p is the pressure in the water, given by (13.7), and Po 
is the constant value in the undisturbed air. The two boundary conditions 
at the free surface are then 


on y=7(X),X»,t). (13.12) 
φ, ὁ 5 (i+ H%,+%) +an=0. 


Usually one boundary condition is given for Laplace’s equation, but that is 
when the boundary is known. Two conditions are needed at a free surface 
because the surface position ἢ has to be determined as well as φ. 

On a solid fixed boundary, the normal velocity of the fluid must 
vanish, that is, n- Vp=0. In particular, if the bottom is y = — ho(x,, x), we 
have | 


Py + Px ox, + Px,Mox, = 0 on y= — ho x), Xz). (13.13) 


This is the special case of the interface condition (13.10) since we take 
F(X 1,X2,¥,t)=y + ho(x,,x). For a horizontal flat bottom hy is a constant 
and 


g?=0 on y=—hp. (13.14) 


13.2 Variational Formulation 


In view of the general use of variational principles introduced in 
Chapter 11, it is important to have a variational principle for water waves. 
This does not seem to have been noted explicitly until the relatively recent 
paper by Luke (1967). It is of course well known that Laplace’s equation 
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follows from 


δ[ { [ 7 (vo) φάτο, (13.15) 
but Luke points out that the variational principle 
ὃ { { Ldxdt=0, (13.16) 
R 
‘ I | 2 
ὑπ τ [ {o.+ 5 σφ): τῳ} dy (13.17) 
—ho 


also gives the all-important boundary conditions. Here R is an arbitrary 
region in the (x,f) space. When (13.17) is substituted in (13.16) the 
integration is over the region R, of the (x,y,f) space consisting of points 
with (x,t) in R and —hy<y<vy. The extra terms g, and gy in (13.17), 
compared with Dirichlet’s principle (13.15), affect only the boundary 
conditions, since they may be integrated out and contribute only to terms 
from the boundary of Αἰ. 
For a small change dq in 9, 


-8f [3 ὰ- J LLL Cet Ve V8—) ay} ara 
R ~~ 0 
ἢ [ aie ee of" odes) dt 
= J f J Gans * φρ)δφῷ | dx 
R 


"3 SJ [(η, τῷ Px, 1x, a 9,)59],_, dx αἱ 


+ ff [(PeAox,+%)5],.-,, ἀκάϊ. (13.18) 
R 


(Repeated subscripts i are summed over i=1,2.) The first term integrates 
out to the boundaries of R and vanishes if 5g is chosen to vanish on the 
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boundaries of R. If (13.18) is to vanish for all such 89, it follows that 
φι + By =9, —hyo<y <0, 
ἡ, Ἐφ, ἡ, πφ,ξῦ, y=n, (13.19) 


φ, ἤρα, Ἂς φ- 0, ἘΞ: ho. 


The first is obtained by choosing d9=0 on y=y and y=-— ho, and 

applying the usual variational argument. Then, with the first two terms of 

(13.18) eliminated, an appropriate choice of d69>0 on y=y, dp=0 on 

y = — ho, gives the boundary condition on y=7; similarly the choice 6g =0 

on y=, Op >0 on y= — ho, gives the boundary condition on y = — ho. 
For a variation 6y in (13.16)-(13.17), it is immediate that 


᾿ 2 = 
δ ff Laxar=—pf f]o+3(Ve) ταν) δηάκαιτο, 
R 
and by the usual argument 
at 5 (Vo) +e =0 (13.20) 
Ba 


Equations 13.19-13.20 are the equations established in the previous sec- 
tion, and we see that this formulation is contained in (13.16)(13.17). 

The significance of (13.17) is that the quantity in braces is p — Po: the 
principle is one of stationary pressure! The relation of this to Hamilton’s 
principle is discussed in detail by Seliger and Whitham (1968). 


LINEAR THEORY 


13.3 The Linearized Formulation 


For small perturbations on water initially at rest, ἡ and ῳ are small 
and the equations may be linearized for a first investigation. The linearized 
free surface conditions (13.12) are 


= Py» φ, + gn =0, (13.21) 


Sec 13.4 LINEAR WAVES IN WATER OF CONSTANT DEPTH 437 

and we may linearize further by applying these conditions on y =0 rather 

than on y=7. After this further linearization, ἡ can be eliminated to give 
Py, + Bp, =9 on y=0. 


Laplace’s equation and the boundary condition (13.13) on the bottom are 
already linear and independent of ἡ. Thus we have the linear problem for 
@ alone: 


Px x, 7 Px x, + By =9; —hh<y <0, 


Gutsy, =9, y=, (13.22) 
Y, πᾶ ἤρχ, Px, τ hox,Px, = 0, aot ho. 


After the solution for g has been found, the surface is given from (13.21) 
by 


l 
M( XpXpt) = — φιίχι»Χ2»0,1). (13.23) 


The problem in (13.22) has to be supplemented by appropriate initial 
conditions. 


13.4 Linear Waves in Water of Constant Depth 


In the case of water waves, the waves propagate horizontally in that 
the elementary sinusoidal solutions take the form 


na Aen, g= V¥(yjen 


they are oscillatory in x,¢ but not in y. From Laplace’s equation, this form 
of is a solution provided 


1/2 
γ"- κὟ ΞΟ, «= [κ|ΞΞ (KT Ὁ Kz) . 


For water of constant depth Ay, the boundary condition on y= -- ἤρ 
requires Y’(y)=0. Hence 


Ya«coshk(hy+y). 
From (13.23), 


A=“ y(0 
ee 


438 WATER WAVES Chap. 13 


is the amplitude of ἡ, so we take 


ig coshk(hyt+y) 
ω cosh khy 


Then 


(13.24) 
ig coshk(h)t+y) 
ω cosh kh ᾿ 


ix-x— wt 


The remaining condition φ, + gp, =0 on y =0 gives the dispersion relation 


w? = gx tanhkhy. (13.25) 


It was noted in Section 11.1 that differential equations must lead to 
polynomial dispersion relations provided that the dependence is sinusoidal 
in all the independent variables. The transcendental equation (13.25) is 
obtained here because the variation in y is not sinusoidal. One might say 
that the waves are in (x,/) space and the y dependence provides a coupling 
between the wave motions at different depths. 


13.5 Initial Value Problem 


The dispersion relation (13.25) has two modes w= + W(«), where 
W(«)= V gxtanhkh, . (13.26) 


The possible branch point at «=0 is spurious, since gx tanhkhy~ghok’ as 


khy—0. The function W chosen to have W~kV gh, near the origin is 
single valued and analytic on the real x axis. It has branch points at the 
other zeros and infinities of tanhhy, at κἤρτε Ξὸ ππὶ, - (ἡ -- 1)πὶ, 
n=1,2,3.... The functions W(«) and — W(«) are both single valued 
analytic functions of κ in the complex « plane cut from -- οοἱ to — wi/2h, 


and from wi/2h, to coi. 
The general solution is obtained from the Fourier transforms corre - 


sponding to (13.24) with two modes corresponding to w= + W(«). Two 
initial conditions are needed to determine the arbitrary functions F(«) in 
the transforms. Of course any prescribed function ῳ must satisfy Laplace’s 
equation, otherwise compressibility effects will come into play and change 
the initial distribution rapidly to some new effective initial distribution. For 
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simplicity we take the case of fluid initially at rest with p=0. Then, from 
(13.21), n,=0 initially. To this we add a prescribed initial surface 


n(x,0) Ξε ηο(Χ). t=0. (13.27) 


For this problem the solution is 
00 00 
n(xt) =; F(«)e™*™-™ det f F(«)e***™ de, (13.28) 


where F(«) is the Fourier transform of 47 9(x). 
| For the one dimensional problem, « and x are scalars in (13.28), and 


οο 


F(n)= qo [πο α)ετ "τάς (13.29) 


The general solution can be reconstructed from the special case (x) 
= §(x), F(«)=1/47, which is famous in water waves as the Cauchy- 
Poisson problem. Its solution can be put in the form 


n(x,t)= 7 J cosxcos W(«)tdk (13.30) 


noted in (11.19). 
For axial symmetry about a vertical line the two dimensional form of 


(13.28) can be reduced to 


co fla ; 
(r,t) =2 [ J κΕ(κ) οἰ" οοδέρος W()t dk ἀξ, 


where r=|x|, «=|«|, and & is the angle between x and κ. An integral 
representation for the Bessel function Jo is 


a 1 τὰ ἱκγοοβξ 
Jo( nr) = 5 einrcost de. 
and the solution may be written 
io 2) 
n(r,t)=4a [ kF(k)Jo(xr) cos W(k)t dk. (13.31) 
0 
The inverse formula may be written similarly as 


F(x)= τ: [ yng rol Kr) αν. (13.32) 
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Of course these may be obtained also by separation of variables in polar 
coordinates and Fourier-Bessel transforms. For a 6 function initial condi- 
tion 4,(7) = 6(r)/2ar and the solution is 


nCni\= iq J Solar) cos W(x) tdk. (13.33) 


The asymptotic results of Chapter 11 may be applied to these solu- 
tions. In particular, from (11.24)-(11.25), the as ol behavior for the 
one dimensional solution is 


27 : ᾿ πὶ x 
~29( FO ra exp | ikx iW(k)t+ 4 }} oo, — >0, 


(13.34) 


where k(x, ἢ is the positive root of 


Ww’ (k)==, (13.35) 
and F(k) is given by (13.29). The interpretation was discussed in detail in 
Chapter 11, and the properties of the group velocity C(x) were discussed in 
(12.4)-(12.6). Since C(k) is a decreasing function of k, the longest waves 
appear at the head of the disturbance and are followed by successively 
shorter waves. The group lines of constant A and the phase lines of 
constant @ are indicated in Fig. 13.1; a typical wavetrain is shown in Fig. 
13.2. 


x 


Fig. 13.1. Group lines (solid lines) and phase lines (dashed lines) for water waves. 


For finite depth there is a finite maximum group velocity V gh, for 


kh 0, so that the head of the disturbance moves with velocity V gh, . 
This is a sharp wavefront only in the approximation (13.34), not in the full 
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Fig. 13.2. Wavetrain near the front of a disturbance in water waves. 


solution. In the full solution the disturbance falls off exponentially, without 
oscillation, ahead of this front and the disturbance 15 relatively small. Since 


C(k)= W'(k)> V gh, and W"(k)>0 as khy—0, (13.34) is not valid in the 
neighborhood of the transition region. We now investigate the true be- 
havior. 


13.6 Behavior Near the Front of the Wavetrain 


Exactly on the line x= V gh, t, the correct asymptotic behavior can 
be found from the extended form of the stationary phase argument in 
(11.26), since we have W’’(0)40. If F(0) is finite and nonzero, this gives 
an amplitude decay 


gat 1/3, (13.36) 


~1/2 


to replace the decay nxt away from the front. Since 


F(0) = 7 J nol) a, (13.37) 


this applies when the total initial elevation is finite and nonzero. 

However, we would like to have a uniformly valid solution through 
the whole transition region. It may be obtained by noting that the entire 
transition region corresponds to small values of k. Both (13.34) for small k 
and (13.36) for k=0 may be included by expanding the exponent in the 
Fourier transform about x =0, rather than the stationary point, and retain- 
ing up to third powers of x, From (13.26), 


W(«)~eok — γκ᾽ ἘΠ... (13.38) 
where 


co= νεῖ, γπτλξνίμο. (13.39) 
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Therefore, near the head of the wave moving to the right, we take 
ee] 
n~ [ F(«) exp { i (x — cot) + iyn*t} dk. (13.40) 
— 00 


It is also consistent to expand F(«) in its Taylor series and retain only the 
first term. If {© ηρί(Χ) ἄχ is finite and chosen to be equal to one, the first 
term is F(0)=1/47 and the solution is 


lf? ; 
n~y= go J exp{i(x— cot) + ive") dee (13.41) 


This may be expressed in terms of the standard Airy integral 


as)=a [ exp{i(se+ 55°) | -if cos (sz +55?) ds, 
“ae 0 


by a change of variable s =(3y/)'/*x. Then we have 
Y= ani Pe | (13.42) 
2(3ye)'" ἰ(35γὴ" 


The Airy function Ai(z) has the general form shown in Fig. 13.2. Its 
asymptotic behavior is 


2νπ 
Ai(z)~ 
|] Se lel/4sin( $ [2P/2+ τ) peer 
| a 3 4 


From these, we see that ἡ, decays exponentially ahead of x=cot and 
becomes oscillatory behind. Exactly on x=cot, n,<7¢7'/? in agreement 
with (13.36). The transition region is of width proportional to (yt)'/? about 
X=Cot. Away from the transition region, as (x — ¢ot)/(yt)!/73 — &, 


3/2 

~ =1/4. 2 (¢ot— x) 7 
ne~ (4a)? {3 yt(egt—x)} 0 “ἴβεις £2 ——_ +2). (13.43 
if { 0 } 3 (3yt)'/? 4 ) 


It may be verified that this merges correctly with (13.34)-(13.35). 
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If F(x)~ Fx” for integer n as x0, the solution (13.40) can be written 
in terms of n, by taking the appropriate number of x derivatives if n >0, or 
the appropriate number of integrals with respect to x if n<0. For example, 
the solution for the step function 


0, 0, 
No( x) = | ἘΠ 


Ι, χεθ, 
is just 
n~f{ n(x) dx 
Ι Ὁ Χ-- Col 
- --, [ Ai(s)ds, Ζ5Ξ ------τς. (13.44) 
a) ; (3y0)'" 


The Airy function has the property 


7 Ai(s)ds=1. 


— 0c 


The factor 4 appears in (13.42) and (13.44) because these represent only 
the waves moving to the right; those moving to the left complete the full 
initial condition. 

A simpler view of these solutions is to note that the dispersion relation 
(13.38) corresponds to the equation 


N+ Cos + YNxxx Ξ30. (13.45) 


We are solving this equation for ηρ(χ)Ξ %6(x) in (13.42) and for 7 
= 1 H(— x) in (13.44). The solutions are members of the family of similar- 
ity solutions 


SPO OS, ope OD 
n=(3yt) SnlzZ), aa (13.46) 


after substitution it is easy to relate f(z) to Airy’s equation 


ΑΙ" (2) =zAi(z) (13.47) 


and construct the solutions. 
Equation 13.45 is the linearized Korteweg-deVries equation, which 
will play an important role later. We might note that for any dispersion 
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relation with an expansion in the form (13.38), the long waves are de- 
scribed by (13.45) in the linear theory, and the solutions (13.42) and (13.44) 
apply. 

A restriction on the linear theory might also be noted. In (13.42) the 
amplitude of the first few crests decays proportional to ¢~!/7, whereas the 
dispersive effects (of relative order k”) decrease like ¢~?/3. Thus in the final 
decay nonlinear effects eventually become as important as the dispersion. 
Under appropriate conditions there is an intermediate range in which the 
asymptotic linear theory applies. The nonlinear effects require an extra 
term proportional to 7, in (13.45). The equation is then the full Korteweg- 
deVries equation and we shall see later that the decay is eventually halted 
and a series of solitary waves is formed. This nonuniform validity of the 
linear theory near the front of a wavetrain is similar in a general way to 
that discussed in Chapter 2 for hyperbolic equations. 


13.7 Waves on an Interface between Two Fluids 


The theory discussed so far ignores the changes in pressure above the 
water surface due to the motion of the air. We next confirm this in detail 
in a typical case. The argument can be usefully combined with a discussion 
of other effects on the interface between two fluids including the case of 
comparable densities. We consider a fluid with density p’ above one with 
density p, and, for simplicity, the fluids are infinitely deep. The flows are 
irrotational with velocity potentials φ΄, φ, respectively, and the interface is 
y =n. The pressures in the two fluids are 


ὔ , ᾿ Ι pee 
P’—Po=-?P [φ{Ὁ 5 (Ve') +a}, 


l 2 
P~Po= P| %,+ (V9) +o}, 


where py is the common undisturbed value, and the conditions on the free 
surface are 


Pp =P, 
1+ Pe Ne, + PxMx,~ P=9, 5 ony=n. 
ἡ, Ἔ Py Ne, Ἐφ, ἢ, — Py =O; 


It is interesting to consider perturbations to main streams U’,U in the two 
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fluids. If we consider only one dimensional waves and linearize the 
boundary conditions, we set 


p= U'x- SUA, p= Ux- 5 U%+®, 


and retain only the first order terms in ®’,®,y. The boundary conditions 
become 


ρ(Φ, + Ἦ υ'Φ, + gn) =p(®, + U®, + gn), 
n, + U'n, — ©, =0, ony=0. (13.48) 
n, + Un, — ®, =0, 


Since ®’,® satisfy Laplace’s equation and tend to zero as y>+ 0, 
y—-— ©, respectively, the elementary solution takes the form 


Φ' = B’ elle @ = Be iex— ed +9 n= Aele—o), 
The boundary conditions (13.48) then give the dispersion relation 


1/2 

+ / ἤ = 7 “ 
ae ΤΡ οι ee ν ΔΕ υ--υλ᾽. (13.49) 
ὰ A Kptp (p+o’) 


For the case U= U’=0, it is noted that 


-(+(83) 


In the limit p’/p—oo, this confirms the elementary solution neglecting 
motion of the air and provides the small correction. 

But it is also interesting that various instabilities are indicated when w 
has an imaginary part. The following cases may be singled out: 


1/2 


1. U=U’=0. This is unstable if p’>p, as would be expected. 

2. g=0,U#U’. This case is always unstable and is known as 
Helmholtz instability. 

3. p=p’,U#U’. This is the same as 2 above, since the gravitational 
effects can arise only for different densities. 

4. p#p’,U#U’. The solution is always unstable for sufficiently 
short waves, but this result would be modified by the stabilizing influence 
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of surface tension for extremely short waves. The effects of surface tension 
are easily incorporated, using the boundary condition given in (13.50). 


13.8 Surface Tension 


Surface tension acts like a stretched membrane on the surface of the 
water. For small deviations y=7(x,,x,,¢) from a plane surface the net 
effect is a normal force 7y,, per unit area. When this is included, the 
pressure condition at the surface becomes 


pt την = Po: (13.50) 


and the linearized conditions (13.21) are modified to 


: T 
m= φ,τβη- a on y=0. (13.51) 
The functional forms of ἢ and @ are the same as before, but the revised 
boundary conditions at the surface lead to the dispersion relation 


w? = gk tanh ho + x), (13.52) 


The properties and consequences of this were discussed and applied to 
some extent in Chapter 12. 

It was noted that the group velocity now has a minimum value at 
k=k,,. At the minimum W”(k,,)=0 and there is again a transition region 
in which (13.34) does not apply. The behavior of the solution in the 
transition region can be found by an approach similar to that in Section 
13.6. The expansion 


W= Wy) + (6 τὶ, ἀρ) τ τα τα, WK) το: 
is used in the Fourier transform solution, and the resulting form related to 
the Airy function. The details are given, for example, in Jeffreys and 
Jeffreys (1956, Section 17.09). 
13.9 Waves on a Steady Stream 


The geometry of various steady wave patterns was determined in 
Chapter 12. The variation of the amplitude along each group line can be 
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determined from the general group velocity concepts. However, as pointed 
out earlier, the initial distribution of amplitude among the different group 
lines can be obtained only from a more complete solution. We now study 
the Fourier transform solution for the uniform stream case, show how the 
simple kinematic description ties in with the full transform solution, and 
determine the amplitudes. We shall take the source to be an external 
steady pressure applied to the surface of the stream, rather than a pre- 
scribed displacement, since this more nearly represents the effect of ἃ 
floating body. 

When solving steady wave problems by transforms, care is needed in 
applying a suitable radiation condition to ensure uniqueness. It is interest- 
ing to see how the radiation condition in this context parallels the use of 
group velocity arguments in determining where the waves will appear. The 
nonuniqueness arises because the steady state is artificial in that no flow 
situation can have existed for all time. In principle, an ideal way to correct 
this is to solve a more realistic unsteady problem with suitable initial 
conditions applied at some finite time in the past, f= — 9, say, then let 
ty—>co in the solution. This program may be difficult to carry through, 
however, with a lot of unnecessary detail that does not figure in the final 
answer. A simpler version of the radiation condition follows this idea in 
principle and yet requires only a minimum extension of the problem. In 
the present case we take the applied pressure on y =0 to be 


πα = f(x,,x,)e", €>0. (13.53) 


It corresponds to a source that was effectively zero a long time ago and 
then grew gradually until it is close to f(x,,x,) at current times. This device 
has the desirable features of the initial value problem, having reasonable 
starting conditions, and yet keeps the time dependence simple. After the 
problem is solved we take the limit «0 to get the steady state solution. 

The boundary conditions (13.12) are modified by the inclusion of the 
applied pressure distribution, and the contribution of surface tension is 
included since it was seen in Section 12.3 to have particularly interesting 
consequences for the appearance of upstream waves. For small perturba- 
tions about a main stream U in the x, direction, the velocity potential is 
taken to be 


p= ὕχ,- 5 U%+® (13.54) 


where V® is small compared with U. The linearized boundary conditions 
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at the free surface are then 


ἡ, + Un, — ®, =9, 
7 ony=0. (13.55) 
®,+ U®,. + gn — ps Ξ —f(x)e* 


The perturbation potential ® still satisfies Laplace’s equation and, for 
simplicity, we take the case of infinite depth so that ᾧ-»0 as y—>— oo. The 
Fourier transform solution which satisfies the latter two, and which varies 
like ο΄, is 


oO . 
Dae“ f B(w)e™**%dk, «=k, 
οο 


n= ef A(«)e™*dk. 
The boundary conditions (13.55) then relate 4 and B to each other and to 
the Fourier transform Ε(κ) of f(x). The result is 


᾿ς KF (#«)e™*dk 
"θοῦ f Pei) A (13.56) 


Ad ᾿ 
os (κιῦ -- ie)” — w(x) 


where 
2 = gk + ” (13.57) 


is the dispersion relation for waves moving into water at rest. 
The role of ε can now be seen. For «=O there would be poles 
satisfying 


K7U? — we(k) =0 (13.58) 


on the path integration. The presumption would be that the path should go 
one way or the other around each pole, but the open choice leads to 
nonuniquesness. With e€>0, the roots of the denominator in (13.56) are 
complex; the poles have been pushed off the path and the ambiguity does 
not arise. Whether a pole contributes will then be determined by further 
deformations of the path in evaluating the integrals and will correspond to 
a particular choice of path in going around the pole. The values of κ 
satisfying (13.58) determine which values of « can make a strong contribu- 
tion; the position of the pole above or below the path will determine where 
the contribution appears. The condition (13.58) is essentially the condition 


Sec 13.9 WAVES ON A STEADY STREAM 449 


(12.23) [or (12.24) with w=0] that determines which waves can stand 
steady in the stream and so make a large contribution. The analysis 
showing whether the pole is to be included corresponds to the group 
velocity determination of where these waves appear. 


One Dimensional Gravity Waves 


For simplicity we consider first the one dimensional case and neglect 
surface tension. Then (13.56) becomes 


_ KF («ee Ε 
n(x,t) =e peat k=|K)}. (13.59) 


k,U— ie)” — gk 


The appearance of x =|k,| should be carefully noted; it stems from the fact 
that both positive and negative values of «, appear in the transform, and in 
the one dimensional case the exponential in the expression for ® must be 


exp(ik,x,+|K,|y) 
to ensure ®—>0 as y>— ©. 
In evaluating (13.59), we shall choose the special case of f(x,) 
= P§(x,), F(x,)= P/2m, since the more general case can always be recon- 
structed. Then we have 


en ke dey 
WX, = 
te (kU — ie) — a | 


Since k=|«,| appears in the integral, it is convenient to break the integral 
into the ranges x, Ὁ and x, <0, and use κ as the variable of integration in 


each. Then 
lo @) F 
πὰ ἢ ἘΞ Je ΠΡ Ke ἱκχιΩκ 
(κῦ-Ξ ie)” — gk ὃ (kU + ie)” -- gx 


Since e—>0 ultimately, the εἶ term in the denominators may be neglected 
and the e“ factor, which has now served its purpose, may be dropped. We 


have 
2 co ἰκχ 99 —iKx 
pra tim[ { ee — + fiery fh 30) 
P «».οι}} κυ-2ιεῦ-ε Jy κυλελιεῦ-- 


450 WATER WAVES Chap. 13 


The poles of the two integrands are at 


respectively, The paths can be rotated into either the positive or negative 
imaginary axis. For x, >0, the path of the first integral can be rotated into 
_ the positive imaginary axis and the path of the second into the negative 
imaginary axis; both poles contribute. We have 


U*n i co 4,7 mx, ᾽ οΟΟ, τ Ρῖχ 
e dm ..; i e idm . _; 
errs | eee a e dm _ ikx, 
0 0 


P 27 im—k 27 imtk 
= —2sinkx,++ Slt e™ idm x,>0 (13.61) 
, 7 Jo m+ k? | ᾿ ὶ 
where 
& 


For x,;<0, the paths may be rotated the other way and the poles do not 
contribute: 


U οο 
τὴ | Mm _omvim, χι <0. (13.63) 
P 0 


Relation 13.62 may be written 


υτ =c(k); 


it determines the wave number k of waves that can keep a steady position 
against the oncoming stream. Since sinkx, is a solution of the steady free 
surface problem for all x,, the radiation condition is crucial in showing 
that such standing waves appear only downstream (x,>0) of the applied 
pressure disturbance. The integrals in (13.61) and (13.63) are needed in the 
complete solution, but they become small for |x,|>>1. The asymptotic 
expansions of these integrals are found by formally expanding (m?+ k’)~! 
in ascending powers of m? and integrating the resulting series term by 
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term. This procedure gives 


a me~ ll ὦ Ι 3: ne) 
----- Qn~ ------- - -ο-- ea ar 
ὃ m2 + k? kx? ἀπ k®x° 


and it may be justified by Watson’s lemma. 
The conclusion is that away from the source there is a standing wave 
pattern downstream only given by 


2Ρ. 8 
LS eau k= 2" (13.64) 


One Dimensional Waves with Surface Tension. 


When surface tension is included, the expression corresponding to 
(13.60) 15 


ora) : ora) Lae 
2πὴ li e'™*1 ἀκ e "dk 
—— = lim SS -- ὁ Se  --ὐέεξεὺς 
Ρ ¢50 > KU —ieU—g—(T/p)k > «KU +2iceU—g—(T/p)k 


(13.65) 


The poles are close to the zeros of 


nicer ae (13.66) 
this is again the condition 
U=c(k) 


used in (12.20) to determine which waves can stand in the stream. 

When U<c,, where c,, is the minimum wave speed, the zeros of 
(13.66) are not real. Hence there are no poles close ἴο the real axis in 
(13.65), all the contributions to the integrals fall off rapidly with x,, and 
there is no standing wave pattern. This agrees with the conclusion drawn 
in Section 12.3. 

When U>c,, there are two real roots of (13.66), and they are the 
values k, and k, referred to in (12.21)(12.22), with Κα Κα. In this case, 
the integrands in (13.65) have poles close to k, and Κ7. They are located at 


2pU 2pU - 


=k,+ =k > 
Κ Τ (q-k)T 


———_— I€ 
ἢ (κ,- k,)T 
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for the first integral, and at the conjugate points for the second integral. 
For x,>0, the first integral may be rotated onto the positive imaginary 
AXIS pice up the gravity pole near k, but not the other. The second 
integral is rotated on to the negative imaginary axis and also picks up its 
gravity pole. Thus downstream of the source, the gravity waves appear. 
Similarly, for x; <0 the opposite rotations must be used and the capillary 
poles make the contribution. The conclusions agree with the group velocity 
arguments used in (12.21)-(12.22): gravity waves downstream and capillary 
waves upstream. The asymptotic wavetrains are found to be 


—2Pp_ 
——_-~-—— sink, x, x, >0, 


(kp — k,)T 
ae 

(kp—k,)T sink;x), x «0. 
T g 


Ship Waves. 


For the two dimensional problem of gravity waves produced by a 
om source f(x,,X,)= PO(x,,x,) we take F(xk)=P/47* in (13.56) and 
= ek. Then (13.56) reduces to 


= "ἢ ΕΞ eel (x, # waa) } ΚΕΣΡῚ {οι ΟΡ} aay (13.67) 
— ἰε7 u)’ — gx /U? 


where κα (κ2- x3)!/*. It is convenient to introduce polar coordinates 
x,=rcos&, X,=rsin§, 
K, = — KCOSX, kK, =k sin x. 


The contribution from the range 7/2<x<37/2 is the conjugate of the 
range —27/2<x<717/2, and in the limit e—0, (13.67) may be taken as 


2 21] -72 οο -δ 4 
GE ΒΤ ΗΝ ΣΟ ΟΝ. 0ῦϑ.. «ὅτ 9. (13.68) 
Ρ «0 π᾿ cos”x ὃ K— Ko 


where 


(13.69) 
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Since cosx >0, the pole x= kg is in the lower half of the complex « plane. 

The pattern is symmetrical about the x, axis, so it is sufficient to con- 
sider the range O<&< a. When cos(&+ x) >0, that is, —7/2<x<7/2-§, 
the path of the integral in the x plane can be rotated into the negative 
imaginary axis and the pole contributes; when cos(€+x)<0, that 15, 
a/2-&E<x< 7/2, it can be rotated into the positive imaginary axis and 
the pole does not contribute. 

The further details are quite extensive if one keeps careful track of all 
the terms. Here we note the contribution of the pole and gloss over the 
remainder. The contribution of the pole is 


gP [[ exp { -- ikgr cos (+x) } 
4 -- ς΄ σ-ἷἅὅ- π΄ - dx 


= 4 4 , 
aU 7/2 cos'x 


where the ε term in x, can now be dropped. The exponent 


gcos(€+ x) 
= k,cos (+ χ)Ξξ------- 
s(x) Ko (ξ x) U2 cos2x 
has a stationary point at x= where 
tan(é+ y)=2tany. (13.70) 


Hence by the standard stationary phase formula we have 


‘| (seen 


After some simplification, this reduces to 


1/2 
gP : 


" aU* 


exp | — ἰγε(ψ)-- 7 sens"(¥) | 


1/2 1/4 : 
22 Ρ (1+4tan*)’ . Ἢ Ι 
(Fe) Treacy Πππταιῖῃστ [θῶ νψν ΣΦ ns} 
(13.71) 
where 
5 
ΚΞ κκ(ψ) = Wee (13.72) 


and y(&) is determined from (13.70). 
The wave number determination of k and ψ from (13.70) and (13.72) 
agrees with (12.32); the phase krcos(€+ Ψ) agrees with (12.35). The ampli- 
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tude is singular at the maximum wedge angle €=£ =19.5° where tany 
=2~'/2. This is where the lateral and transverse crests meet at a cusp and 
corresponds in the analysis to the confluence of the two stationary points 
of (13.70). Since s”(~)->0, we have a transition region of the same general 
type as that in Section 13.6. At the wedge the phase jumps by 7/2. The 
singular behavior on the x, axis, where 0, is related to the assumption 
of a point disturbance. The singular regions are studied in detail by Ursell 
(1960a). 


NONLINEAR THEORY 


13.10 Shallow Water Theory; Long Waves 


For gravity waves with «kh )—0, the dispersion relation is approxi- 
mately | 


and the phase speed cg= V ghy becomes independent of x. The dispersive 
effects drop out in this limit and, in one space dimension, the Fourier 
superposition of solutions for ἢ leads to 


co ; oO : 
η- [ F( ex die + [ G(K) e+) de 
65 — 0 


= f(x—cot) +g(x+ Cf). 
This is the general solution of the linear wave equation 
The — CoN = 0. (13.74) 


Clearly there must be some direct way of extracting this equation from the 
full equations and, in fact, it was already obtained in a more general 
setting in Section 3.2. There, in the study of river flow, nonlinearity and 
friction were included. Here we neglect frictional effects but are concerned 
with nonlinearity. 

First we recall the previous derivation. The key step is to approximate 
the vertical component of the momentum equation (13.2) by 
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Then 
P—Po=0g(n-y). (13.75) 


The horizontal components of (13.2) become 


(13.76) 


[using now the mixed notation u=(wu),u5,v), X=(%1,X2,y) so that i=1,2 
and the summation is for /=1,2 in (13.76)]. Since the right hand side is 
independent of y, the rate of change of μ, following a particle is indepen- 
dent of y. Hence if u, is independent of y initially, it remains so. We 
consider this to be the case. Then (13.76) becomes 

Ou; Ou; θη 


δι Vax, 78 ae (13.77) 


Although the vertical acceleration was neglected in (13.75) compared with 
the terms retained, there is no reason to neglect dv/dy in (13.1). However, 
we may use the integrated form of (13.1), which must give the conservation 
equation 


oh 


art x, (7) = 0, (13.78) 


where 
h=hj)+7 


is the total depth from y = — hy at the bottom to y= ἡ at the top. In detail 


a an ah 
= og ni dy +[v],--_—[ u;|, - n Ox; a ey τς 5 


x; 


and from the boundary conditions (13.11) and (13.13), this reduces to 


ὃ ( on 
Ox; fue Ot 


since u, is independent of y in this approximation and »,=h,, (13.78) 
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follows. Equations 13.77 and 13.78 are the shallow water equations for 
n(x, ἢ), u(x,t). [Equations 3.37 follow by writing gdn/0x =g0h/0x —gS in 
(13.77) where S= 0h,/0x is the slope of the bottom, and adding a friction 
term]. 

It is easy to make an order of magnitude estimate of the approxima- 
tion. The error for p in (13.75) is of order phgv, and, from (13.1), vx -- 
hou,; hence the relative error in (13.77) is of order 

~ Px Agu er hg 


~~ 


m4? 
pu, u, 7 


where {15 a length scale for the waves in the x direction. This fits with the 
approximation (kA ))?<1 in obtaining (13.73). Thus (13.77)-(13.78) provide 
a nonlinear set for relatively shallow water or, equivalently, for relatively 
long waves. The effects of dispersion do not appear in this approximation. 
In the next section the shallow water equations will be derived as the first 
terms in expansions in (h,//)*; small dispersive effects will be included by 
going to the next order. 

The linearized equations lead to (13.74), but we may obtain nonlinear 
solutions using the theory of Part I, since the system is hyperbolic. In 
particular, for one dimensional waves on a horizontal bottom we may take 


h,+ (uh) .=0, 
13.79 
u, + uu, + gh, =0. 


The characteristic velocities are ut Veh , the Riemann invariants are 
ut 2V gh , and a simple wave moving to the right into water with h= hy is 
given by 
h=H(t), u=2VgH -2Vegh, 
(ξ) 8 870 (13.80) 
x=f+ {3 gH(£) —2V gh, "ἢ 
All such waves carrying an increase of elevation break. Then ἃ discontinu- 


ity may be fitted in, and the shock conditions, derived from the conserva- 
tion forms of (13.79), are 


Are | wh + γε} =0, 
(13.81) 


- U[A]+[uh]=0, 
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as noted in (3.53) and (3.54). This is the turbulent bore. 

This breaking phenomenon is one of the most intriguing long-standing 
problems of water wave theory. First, when gradients are no longer small 
the approximation hj //? is no longer valid, so the solution (13.80) should 
cease to apply long before breaking occurs. Yet breaking certainly does 
occur and in some circumstances does not seem to be too. far away from 
the description given by (13.80); moreover, bores, breakers, and hydraulic 
jumps are sometimes reasonably well described by (13.81). But the shallow 
water theory goes too far: it predicts that αἰ waves carrying an increase of 
elevation break. Observations have long since established that some waves 
do not break. So an invalid theory seems to be right sometimes and wrong 
sometimes! It is not hard to see how the neglected dispersion effects inhibit 
breaking, but in the simple theories including some of these effects (see the 
next section), they in turn go too far and show that no waves break! We 
postpone further discussion until the dispersion has been included. Before 
doing this, however, a few more details of shallow water theory will be 
noted. 


Dam Break Problem. 


First, a classic solution for the breaking of a dam does not involve 
bores (strangely enough) and is easily solved by simple wave theory. The 
problem is formulated with initial conditions 


u=Q, —-o<x< 0, 
h=0, 0<x< 0, at r=0. 
h=H,)>9, —-o<x<0, 


Fig. 13.3. Characteristic pattern in the dam-break problem. 
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Then, on any positive characteristic C, from the region h= Ηρ (see Fig. 
13.3), the Riemann invariant 


u+2V gh =2V gH, . (13.82) 


In the region covered by these characteristics, the solution is a simple wave 
on straight C_ characteristic through the origin. They are given by 


= =u-Vgh. (13.83) 


But (13.82) and (13.83) provide the whole solution: 


Veh =3(2V μῆς -) 
-ν;ῃ, <= <2V gH . (13.84) 
u= 3 (Vali +7), | 


The free surface is parabolic between the front h=0, x=21tV gH, , travel- 
ing with speed 2VgH,, and the undisturbed dam level h=H, at 


x=—-tVgH,. 

Again the shallow water theory is not strictly valid in the initial 
instants since the horizontal length scale / 1s small, but as the flow develops 
Hj /1* becomes small and the real flow is reasonably well described. It 


Should be noted that h=4H,/9, u=2\ gH, /3, remain constant at the 
dam position x=0 for all ¢>0. The speed of the front is modified 
considerably by friction; for attempts to estimate this see Dressler (1952) 
and Whitham (1955). 


Bore Conditions. 


The bore conditions (13.81) conserve mass and momentum across the 
bore and one naturally asks what happens to the energy. The conservation 
of energy for equations (13.79) 1s 


loa 1 ᾿ (5 3 ) Z 
(50h + 58h +(Zwh+ ugh?) πὸ (13.85) 


This would provide a third potential shock condition, but only two 
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conditions can be used with the system (13.79). Rayleigh proposed that in 
fact energy is not conserved across a bore, attributing the loss to the 
observed turbulence, so that the shock condition corresponding to (13.85) 
should not be used. It is easy to show that whereas (13.79) imply (13.85), 
the jump conditions (13.81) imply 


2 2 

| ih + wah] —U| 51h+ χει] <0 for ἱρά}. (13.86) 
Ι 1 

Since bores are required only when h,>h,, the Joss of energy agrees in 

sign. The energy plays a role similar to the entropy in gas dynamics; in gas 

dynamics any internal energy is included in the detailed description, so 

energy is conserved and the extra variable in the description allows an 


extra shock condition. Turbulent energy is not included in (13.85). 
A convenient form of the bore conditions (13.81) is 


1/2 

hh, +h 

ieee gh(h, + hy) (13.87) 
2h, 


(13.88) 


u,=u,+ 


1/2 
hy—h, | ghy(h, + hy) 
h, 2h, : 


Further Conservation Equations. 


It is interesting that the shallow water equations (13.79) admit an 
infinite number of conservation equations of the general form 


ὅρῳ, h)+ = O(u,h)=0. 
It is only necessary that 
QO,=uP,+hP,, O,=28P,,t+ uP,. 
Thus any solution of | 
ΞΡ = AP iy, 


will lead to a conservation equation. The most interesting are polynomial 
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in u and h. They may be obtained consistently by taking 


P= > p,(uyh™, 
m=0 
from which it follows that 


gp,,_,=m(m-— I|)p,,, m=2,-°-,N, 


Po=9, P= 
The first few are: 
Ρ Q 
u Su?+ gh 
h uh 
uh wht sgh 
1 2 1 2 1 3 2 
yu A+ τ 8} 5 ΜῈ ugh 
dos 2 Ἵν δι ee Gore 
3 wh + ugh 3 wht Zugh + aah 


The second, third, and fourth correspond to mass, momentum, and energy, 
respectively. The others have no obvious interpretation. However, since 
each can be used to give a constant integral 


[ { P(u,h) — P(0,h) } dx = ςοπβίαηί 


in any problem in which u-0, hh, at +o, an infinite number of 
integrals of the solution are known. One might therefore expect to be able 
to find the solution analytically. Indeed, by means of the hodograph 
transformation (13.79) can be converted into a linear equation and solved 
in principle; the analysis is identical to that of Section 6.12 with y =2. 


13.11 The Korteweg-deVries and Boussinesq Equations 


We next consider how dispersive effects may be incorporated into the 
shallow water theory. This may be done by setting up a more formal 
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expansion in the small parameter (h,//)? and including the next order 
terms beyond shallow water theory. Before doing this, however, it is 
advantageous for flexibility and insight to use a quicker and more intuitive 
procedure. We consider the case of one dimensional waves with hy con- 
stant. The linearized version of the equations we seek must give the 
dispersion relation (13.25) in the next approximation beyond (13.73): 


02 = 0242 — ὦ e2prect (13.89) 


3 


An equation for ἡ with this dispersion relation is 
Ϊ 
Ne — CONex ~~ Ἵ CON xxxx =0. (13.90) 


The shallow water equations (13.79) linearize to (13.74). If we can add an 
extra linear term to (13.79) so that they linearize to (13.90), we should have 
a system which includes both the nonlinear effects of relative order a/hy 
(where a is a typical amplitude) and the dispersive effects of relative order 
ho/l*. This is easily done. There are various forms equivalent in the 
desired approximation. If we choose to add a term ph,,.. in the second of 
(13.79), the linearized equations are 


ἡ, + Aout, = 9, 
u, + 81x τ VN xxx = 0, 
and u may be eliminated to give 
Ne =a CONex cae honxxxx = 0. 


Therefore we choose v= 4cZh, to agree with (13.90). The argument then is 
that the system 


h,+ (uh), =0, 
, (13.91) 
u, + uu, + gh, + 5 COMohexs = 0, 


reduces correctly to (13.90) in the limit a/h)—0, correctly to (13.79) in the 
limit hg//*->0, and so combines the first order corrections to (13.74) in 
both a/h, and in h¢/1?. 

One can always substitute the lowest approximation (13.74) into the 
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correction term. Hence an equivalent system to the order considered is 


h, + ( uh ). = 0. 
(13.92) 
u,+uu, Ἔφη, + sight, Ξε. 
This is the system favored by Boussinesq, who first formulated these 
equations. The linearized version of (13.92) leads to 


l 
Ui; an CON ex i 5 Ποῆχκε = 0, 


and the dispersion relation is 


22 
; Οὐκ 


ω [+ /3eR (13.93) 
The expansion of this dispersion relation for small («h,)* agrees with 
(13.89) to the first two terms; hence the two systems are equivalent in this 
approximation. However, if the equations are in fact used when h?//? is 
not small, (13.92) is superior to (13.91). According to (13.89) small pertur- 
bations with (kh,)*>3 will in fact amplify because w becomes imaginary, 
whereas (13.93) retains a real w even though it is inaccurate in the range 
concerned. In numerical work, the various effects of finite differencing and 
truncation introduce small oscillations of small wavelength even if the 
formulated analytic problem satisfies h2//*«1, so (13.92) is preferred. 

Boussinesq’s equations include waves moving to both left and right. 
By following the same steps restricted to waves moving to the right only, 
we obtained the Korteweg-deVries equation. For waves moving to the 
right the first two terms in the dispersion relation are 


W=CyokK-yK, y= 7 Cohe, (13.94) 
corresponding to the equation 
ῃ, Ἔ Cone + γη,χ, = 9. (13.95) 


In the nonlinear shallow water equations (13.79), waves moving to the right 
into undisturbed water of depth hy satisfy the Riemann invariant 


u=2Ve(hg+n) —2V eh, , (13.96) 
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and after substitution in either of the equations in (13.79) we have 

n+ {3Vg(ho+m) —2V eho }n,=0. (13.97) 
Combining (13.95) and (13.97), we have 


n+ { 3Vg(ho+ 7) —2V gho bn. + Yee =0- (13.98) 


If the nonlinear terms are approximated to the first order in a/hg, we take 
3 7 

n + co ἘΞ 7 |e YMxxx = 9. (13.99) 
0 


This is the Korteweg-deVries equation. There is no reason to believe that 
retaining (13.98) is preferable, since other terms, proportional to the 
product of a/h, and h?/I*, for example, may be as important as the 
nonlinear terms in a*/h2. Again one may use 7,~ — Con, in the dispersive 
correction term and adopt 


3 ἢ Y | 
UF + ed ΗΝ 2 ho hn Co Next 0. 


The linearized equation then has the dispersion relation 


Cok 


= εξιεξει 
1+7K7/Co 


This agrees with (13.94) for small κ, and, in contrast to (13.94), has 
bounded phase and group velocities if « becomes large. Since ὦ remains 
real in each case, the modification is less compelling than in the 
Boussinesq case but nevertheless is probably desirable for some purposes 
(Benjamin et al., 1972). However, many fascinating exact analytic solutions 
have been found for (13.99), and in general it may be transformed into a 
linear integral equation; at present these features dominate the other 
issues. 

The preceding derivations allow great flexibility and the approach 
naturally allows the various alternatives that were discussed. It also be- 
comes clear that the equations apply to many dispersive wave problems 
quite apart from water waves. Any dispersion relation with an odd 
function for w(k) may be expanded to the two terms in (13.89) or in 
(13.94), and then (13.90) and (13.95) cover the linearized theory. It is then 
only necessary to have some access to the form of the nonlinear terms and 
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those in (13.91) or (13.99) are rather typical. In this way the equations have 
arisen in plasma physics, for example. | 

Of course a confirming formal expansion is also desirable and inform- 
ative. If, temporarily, Y is measured from the horizontal bottom, we have 
to solve Laplace’s equation 


Dx. + Pyy =0 


with gy=0 on Y=0. The shallow water theory, with Q~, approximately 
independent of Y, and the small total depth both Suggest an expansion 


φ- : Y"f (x, ἡ). 


When this is substituted in Laplace’s equation and the boundary condition 
on Y=0 is used, we have 


co m 2m 
p= S\(-1)" ΩΝ μι (13.100) 
0 


(2m)! 0x2™’ 


where f= f,. The final step is to substitute this expansion into the boundary 
conditions on the free surface. Because they are nonlinear and applied on 
Y=h)+y, it is here that the analysis becomes a little involved and terms 
have to be ordered in expansions with respect to the two parameters 
a=a/ho, B=ho/1?. In carrying through the details, it is best to normalize 
the variables from the start by taking the original variables (primed) to be 


vel, Yehy, rat, 
(0 
᾿ , δίαφ 
ηξαη, φ'ξ : 
C9 


The different stretchings in Y and x introduce the crucial step. In the 
normalized variables, the problem is formulated as 


B9,,.+ Pyy =9, 0< Y¥<l1l+ay, 


py = 0, Y=0, 


1 
1, + αφ, ἡ, -- ras - 0, 


Y=I+an-: 


Lg te eet 
WG Ua τυ 
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The expansion for g now appears as an expansion in powers of β, but from 
Laplace’s equation and gy =0 on Y=0, we again have 


y2m ot gm 
p= er 1)" = ae 


On substitution in the surface conditions one finds 


n+ {Ut αἡ) ,.). - | τι + an) γα + +a( + an) Nec B+ O( B*)=0, 
qtf,+ Saf?— 5 (1+an) i OS a> | BE OUR) =0. 


If all terms in β are dropped and the second equation differentiated with 
respect to x, we have the nonlinear shallow water equations: 


n+ {(1+an)w },=0, 
w, + aww, + 7, =9, w=f,. 


If the terms in the first power of B are retained, but to simplify them terms 
of O(a) are dropped, we have 


nyt [(1Ἐ αη}»},-- 2 Bites + O( aB,B?) =0, 
(13.101) 


w, + aww, +n, Bw,,,,+ O(aB,B7)=0, w=f,. 


These are just a variant of Boussinesq’s equations. The quantity w is only 
the first term in the expansion of the velocity p,, which is 


y2 
Pq = W— BW + OC B?). 
The value averaged over the depth is 


Bw,,,+ O( a8, B*); 


= 
Ι 
= 


the inverse is 


ere + Bi. + O(a, 82). 


466 WATER WAVES Chap. 13 


If these are used in the equations, we have 


n+ {(i+an)i},+ O(aB, 87) =0, 
ii, + aii, +7, -- + βῆ, + O(aB, B?) =0. 


Finally, substituting the lowest order %,=—17,+O(a,8) from the first 
equation into the ὥ,., term, we have Bousinesq’s equations (13.92) in 
normalized form. 

The Korteweg-deVries equation is derived from any of these systems 
by specializing to a wave moving to the right. To lowest order, neglecting 
the terms of order a and β, such a solution of (13.101) has 


w=y, 7,+7,=0. 
We look for a solution, corrected to first order in a and β, in the form 
w=n+ad + BB+ O(a’*+ B’), 
where A and B are functions of ἡ and its x derivatives. Then equations 


(13.101) become | 


ἡ, +7, +a(A, +277, ) + 6(B,— Fens] + O(a? + B7) = 0, 


l 
n, +, + 0(A,+7,) + BB, yew + O(a’+ B?) τύ. 


Since n,= -- ἡ, + O(a,f), all 1 derivatives in the first order terms may be 
replaced by minus the x derivatives. Then the two equations are consistent 
if 
1 5 l 

A=- = B= ~=n,,. 

| 4" 3 lex 
Hence we have 

w=7)— 1 og? + = βη,. + O(a’+ B’), 


4 
(13.102) 


η, +, + > oon, + ἔβη, + O(a’+ £7) =0. 


The second equation is the normalized form of the Korteweg-deVries 
equation (13.99). The first is similar to a Riemann invariant. 
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13.12 Solitary and Cnoidal Waves 


The Korteweg-deVries equation was derived in 1895. Earlier, Stokes 
(1847) had found approximate expressions for nonlinear periodic waves in 
the case of infinitely deep water or water of moderate depth, while 
Boussinesg (1871) and Rayleigh (1876) had found approximate expressions 
for the solitary wave, a wave consisting of a single hump of constant shape 
and constant speed, which was first observed experimentally by Scott 
Russell (1844). The solitary wave is most easily obtained as a special 
solution of the equation obtained by Korteweg and deVries, and these 
authors went on to show that periodic solutions were also possible. 
Although Stokes’ approximation does not apply when £ becomes as small 
as a, his solutions overlap with those of Korteweg-deVries when ax β. We 
consider solutions of the Korteweg-deVries equations first, because of their 
simplicity, even though they came much later. 

Both the solitary waves and the periodic waves described by (13.99) 
are found as solutions of constant shape moving with constant velocity. 
They therefore may be described in the form 


From (13.99) we then have 


aha" + κ΄ -ἰ 7 τὸ 


0 


This integrates immediately to 


1 2»» 3 2. U - ΞΞ 
fos +75 [Ξ Is +6 
with a further integration to 
τα τ 1 - )+4G+ =o (13.103) 


after multiplication by ζ΄; Ο and H are constants of integration. 
In the special case when ¢ and its derivatives tend to zero at o, 
G = H=0. Then the equation may be written 


d 2 
38 Se) τοῦτα τ), 


U 


Co 


(13.104) 


ig. 
=I+>. 
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Fig. 13.4. Solitary wave. 


It is clear qualitatively that ¢ increases from {=0 at X= oo, rises to a 
maximum {=a, and then returns symmetrically to =O at X= — οὐ (see 
Fig. 13.4). This is the solitary wave. The range in ἢ 15 ηρΞ hoa, so a plays 
the same role as in the last section. The velocity of the solitary wave 
depends on the amplitude according to 


U=a(i+5 7°] (13.105) 


The actual solution of (13.104) is 


1/2 

3a 
=asech*|——]| X; 13.106 
—— (zi) aren 

hence 

, 1/2 

ἢ = No sech? re (x— Ut) }. (13.107) 
4h? 


This is a solution of the Korteweg-deVries equation for all No/ ho; however, 
the equation is derived under the assumption No/Mo<|1, and in fact solitary 
waves are found to peak at a maximum height; experimentally it is 
No/ hyp~0.7, and theoretically n)/hy~0.78. 

In the general case, G,H +0, 


g*=C(f) 


where Ο(ζ) is a cubic with simple zeros. For bounded solutions, the zeros 
must be real, and bounded solutions must oscillate periodically between 
two of the zeros of ©. We may, without loss of generality, choose the two 
zeros concerned to be § =0 (which fixes hj) and {=a (which is double the 
amplitude). The third zero must then be negative; if we take it to be a— B, 
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it will turn out that B=hj//*, where / is the horizontal length scale, and 


the parameters a and B have the roles appropriate to the discussion of the 
last section. With these choices, the equation for {(X) is 


τα BY asa t)(f-at+B),  O<a<p, (13.108) 


3 
U 2α--Εβ 
ΞΕ ee 13.109 
τ io eer ae ( ) 
The wave length is 
τῆς Γ΄ d 
a ὃ (13.110) 


“VB J Ὑὸ ἐπε ρῆν 


In this nonlinear problem U is the phase speed, since any point on the 
profile moves with this speed. The solution could have been written in the 
form 


§(X) =f(8) =f( «x — ot ) 
and chosen so that f has period 27 in θ. Then 


2 = 
o= Ue=(1+ = =) (13.111) 


and 
_ 27 
Ἧ 
From (13.110), 8 is a function of A and a. Hence the dispersion relation 
(13.111) takes the form 


w=w(k,a). (13.112) 


This is the first instance of a most important result for nonlinear dispersive 
waves: The dispersion relation between frequency w and wave number κ 
involves the amplitude. 

For waves of infinitesimal amplitude, a—0, (13.108) and (13.109) 
reduce to 


d 2 
sh) τί - 08, 
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The solution is 


-α ἃ μος 36 * 
ὅπ 5 + 5 cos 3B he’ 


If we introduce k= V3B / ho, we have 
—~%,a aa 
(= 5 + 5 cos (Kx — wt), 


Ι (13.113) 
ὠξικῦΞξ cot = 5K} 


in agreement with the linear theory [see (13.94) and (13.95)]. In this limit, 
the amplitude drops out of the dispersion relation. 

In the other limit of a—£, the wavelength A, given by (13.110), tends 
to infinity and we have the solitary wave. 

The solution (13.108) may be expressed in standard form in Jacobian 
elliptic functions. It is 


.4.., 
[παρ 75) X, (13.114) 
0 


where the modulus m of the elliptic function is 


1/2 
af & 
m=(4] ; (13.115) 
and 
Aho 
A= K(m), (13.116) 


V3B 


where K(m) is the complete integral of the first kind. In view of (13.114), 
Korteweg and deVries named these solutions cnoidal waves. The form of 
(13.114) confirms B as a measure of hj//*. The modulus m measures the 
relative importance of nonlinearity to dispersion. In the linear limit m0, 
cn—>cos €; in the solitary wave limit m—1, cn £—seché. 

Again it should be noted that cnoidal waves are a solution of the 
Korteweg-deVries equation for all a and β restricted only by 0<a< £8, 
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but the equations are only valid when a and β are small. Like the solitary 
waves, they really attain a maximum height at which crests peak. The 
theoretical analysis is not complete. As a guide to supplement the result for 
solitary waves, it may be noted that numerical calculations of periodic 
waves on deep water (Michell, 1893) peak when a/A=0.142. Strictly 
speaking this is outside the range of the Korteweg-deVries theory, but it 
perhaps gives some idea of the values involved. We should interpret it 
according to aw 4aho, A~(27h,)/(38)'” (the linear values), as aB'/*~1. 
The critical value for solitary waves ax B~0.7-0.8 fits in with this rough 
estimate. It was pointed out at the end of Section 13.6 that solutions of the 
linearized theory a/B<1 cease to be valid near the front of ἃ wavetrain 
because the effective a/B increases with ¢ as too. Since the above 
periodic solutions tend to solitary waves as a/f increases to 1, one might 
expect that the end result is a series of solitary waves. This and other 
problems can now be studied analytically as a consequence of the remark- 
able investigations of Kruskal, Greene, Gardner, Miura, and co-workers. 
They developed an ingenious way of finding fairly general solutions of the 
Korteweg-deVries equation, and these include the main features of the 
resolution of an arbitrary finite initial distribution into a series of solitary 
waves. The explicit solution for the collision of two or more solitary waves 
can also be given. This work ties in with related results for other equations 
and other physical situations, so it is described separately (Chapter 17). 
We now continue the discussion of periodic waves. 


13.13 Stokes Waves 


Stokes’ investigations in water waves, with the first publication in 
1847, are the starting point for the nonlinear theory of dispersive waves. It 
was in this work, and far ahead of other developments, that he found the 
crucial results that, first, periodic wavetrains are possible in nonlinear 
systems and, second, that the dispersion relation involves the amplitude. 
The dependence on amplitude produces important qualitative changes in 
the behavior and introduces new phenomena, not merely corrections to 
some of the numbers. 

It is easiest to present Stokes’ approach on the Korteweg-deVries 
equation and then quote the results without actually doing the more 
extensive details required for the full equations for arbitrary depth. Stokes’ 
aim was to find the next approximation to the linear periodic wavetrain. 
For the Korteweg-deVries equation this corresponds to an expansion in 
powers of a with a<f. It could be obtained from the exact solution 
(13.114), but it is both simpler and more instructive to proceed directly on 
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the equation. We look for a solution of (13.99) in the expansion 

. = {=f (0) τ εἶζ,(θ) +e (8) +---, 6=«x—wt, (13.117) 
where ε is the small parameter a/hy (proportional to a). We then obtain 


the hierarchy 


(w— Cok) $5 — γκ ζ΄ = 0, 


att 3 
(ω-- cok) S5— 1°65" = F con $i, 


(w— cok) $3 —- eG = 5 οκ(ζι 2)" 


and so on. A solution of the first is 
f,=cos8, W=Wo(K), 
where w,(«) is the linear dispersion function 
Wo (Kk) = Cok — yK?. 


If these values are adopted, the right hand side of the second equation for 
§, 15 proportional to sin26 and a solution §,acos26 can be found. Then in 
the third equation, the right hand side can be expressed as a linear 
combination of sin@ and sin3@. The term in sin3@ can be accommodated 
by a solution £,«cos36, but the sin@ term resonates with the operator on 
the left, since on substituting §,<cos@ the left hand side is zero. There is a 
solution with §, @sin@ but this “secular term” is unbounded in θ. Stokes 
argued that the periodic solution can be found if w is also expanded in a 
power series 


W=W)(K) + ew, (Kk) +€70,(K) Ἐ-᾿ 


Since trouble arises only at the third order, we may take w, =O in advance. 
The hierarchy now reads Ps 


(Wo - Cok) Si — γκ ζ΄ =0 ) 
vit 3 
(ωρ — Cok ) $3 — yg) = 5. CoKS SF 


(ωρ — Cok ) $3 — yoo = 5 cok (S452) — nS. 
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If we again take 


_ 3 
{, =cos8, Wy = Cok — YK 9 
we find 
Co 


= cos 26. 
S 8yK2 


The right hand side of the equation for ¢, may be written 


Co 


ae (sin6 +3sin36) +w,sin@. | 
We now choose 
365 
2. 32K 


and the resonant term is eliminated. The final result 15 


2 3 

Je ceoses: ἐπ 5 0052 Ἐ 510: : 0530 τ, (13.118) 
ho 4x*ho 64K "ho 

τὸς ς Ne 9ε’ 

Cok gone 16x7he Ν paren 


The second equation shows the crucial dependence of the dispersion 
relation on the amplitude. 

It should be noted that the expansion parameter is really εκ Ἐξ, and 
this corresponds to a/ 8. In comparing the results with the expansion of 
(13.114) for small m=a/ β, the different choices of the origin for ἡ, and 
hence different choices of hy, must be included. The use of different origins 
is unavoidable; for the solitary wave limit it is convenient to have the 
origin at the troughs, whereas for the linear limit it is convenient to have it 
at the mean level. 

It should also be noted that (13.118) could be motivated as the 
Fourier series of the periodic wavetrain and this general form assumed 
from the start. 


Arbitrary Depth. 


In the general case the aim is to find small amplitude expansions to 
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periodic solutions of 


Pix + Py =9, —hy<y<n, 
P, =9, y=—hy, 


η, + ON, — Y, =9, 
Ι i y=. 
G+ 5+ 5G + 9n=0, 


We have n= (0), p= Q(6,y), θΞ kx —wt, where ἢ and φ are periodic in 0. 
We may choose the origin y=0 so that ἡ has zero mean value and the 
expansion for ἢ is 


n= acos0+ p,a7cos20+---, (13.120) 


where μ 15 a coefficient to be found. With the choice of mean value ἢ =0, 
it is clear from the second condition on y = ἡ that the mean value G, cannot 
be zero and φ must at least have a term ὦ in its expansion. One may also 
interpret this as a consequence of absorbing a constant of integration when 
the expression for (p—p,)/p was first derived. Alternatively, one could 
keep a nonzero mean value in ἡ. Since only derivatives of m occur in 
physical quantities, terms proportional to ¢ or to x are acceptable in φ. A 
term proportional to x represents a nonzero mean in the horizontal 
velocity. Here it can be normalized to zero. Later in discussing modulated 
wavetrains we shall need both ἢ #0 and G, +0, since they both vary in the 
modulations and cannot be normalized out. The extensions of the formula 
are straightforward, so we take 7 =0, 9, =0 here. The expansion for m may 
then be written 


p= v,a*t + v,acoshk( y + hy)sin8 + »,a?cosh2«(y+h)sin20+---. 


(13.121) 


Again to avoid secular terms at the third order, w must also be expanded 
as 


W=Wo(K) Ἑ αζω,(κ) Ἐ - το (13.122) 


When all this is carried through it is found that 


2 
a2 sie 9 tanh* xh, — 10 tanh?xh,+9 Cartes, (13.123) 
gx tanh kh, 8 tanh* κῆρ ) 
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and 


l 3 
by = }xcothaihy{ 1+ al 
0 


Wo 


sinh* κῆρ 


ΒΚ = Wo 
Qsinh2kh,’ |! Ksinhkhy’ 


Vo= 


a2 
8 


The dispersion relation (13.123) is the main result. For κ᾿ 1 we recover 
(13.119), and for «7hZ>>1 we have Stokes’ original result for deep water: 


w?= gk(1+K%7a7+---). (13.124) 


The details of the algebra in these derivations become extensive in the 
arbitrary depth case. It can be offset to some extent by substituting the 
Fourier series expansions in the variational principle (13.16)-(13.17) and 
obtaining the coefficients from the variational equations (see Whitham, 
1967). 

The Stokes expressions are limited to small amplitude and cannot 
exhibit the wavetrain of greatest height at which the crests are observed to 
become sharp. However, Stokes contributed a separate argument to show 
that if sharp crests are attained in a steady profile wave, the angle there 
must be 120°. The argument depends crucially on the restriction to a wave 
of constant shape propagating at a constant speed. In these circumstances 
the flow is steady in a frame moving with the crest. The solutions of 
Laplace’s equation are then analytic functions of z=x-+iy and a rea- 
sonably general singular function (taking the origin at the crest) is 


gotipaz”. 


In local polar coordinates (γ, ὦ), with ὦ measured from the downward 
vertical 


ἣν ia 


x=rsina, y=—Trcosa, z=-—ire ™, 
and we have 


φΞ Cr™ sinmo. 


If n = —rcos@, locally on the free surface, the pressure condition in steady 
flow requires 


l 
9; + = 95+ gn =constant. 
r 
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With the above expressions for φ and ἡ, we require 


C’*m?r’"~* — gr cosa. = constant. 


The powers of r show that 
m=2 
x 


The second boundary condition, which defines the free surface, is a / 0a 
=0 on ®=4,. Hence 


ae ἢ aes ene 


The total angle 2, is 120°. 

Michell (1893) calculated the periodic wave of greatest height numeri- 
cally for water of infinite depth and found the limiting height to be 
attained at a/A=0.142, as previously noted. 

The result on the angle at the crest is not necessarily true in unsteady 
problems. For standing waves a theoretical argument proposed by Penney 
and Price (1952), but less conclusive than in Stokes’ case, suggests 90°; this 
value was confirmed experimentally by Taylor (1953). 


13.14 Breaking and Peaking 


It was remarked earlier that the nonlinear shallow water equations 
which neglect dispersion altogether lead to breaking of the typical hyper- 
bolic kind, with the development of a vertical slope and a multivalued 
profile. It seems clear that the third derivative term in the Korteweg- 
deVries equation will prevent this ever happening in its solutions. But in 
both cases, the long wave assumption under which the equations were 
derived is no longer valid. Since some waves appear to break in this way, if 
the depth is small enough, one concludes that some dispersion is necessary 
but the Korteweg-deVries term is too strong for short wavelengths. This is 
not surpising in view of the fact that the κ΄ term makes (13.94) a bad 
approximation to the full dispersion relation when (xh,)* becomes large. 

On the other hand the dispersion included in the Korteweg-deVries 
equation does allow the solitary and periodic waves which are not found in 
the shallow water theory. For these solutions, however, the Korteweg- 
deVries equation cannot describe the observed symmetrical “peaking” of 
the crests with a finite angle there. Again it might be argued that this is a 
small scale phenomenon where the small wavelength components become 
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important and the assumption (xh)*«1 is no longer adequate. Doubtless 
this is combined with additional nonlinear effects. 

Stokes waves include the full dispersive effects of wo = gx tanhkho, but 
being limited to small amplitude, they do not show the solitary waves nor 
do they show the peaking. 

Although both breaking and peaking, as well as criteria for the 
occurrence of each, are without doubt contained in the equations of the 
exact potential theory, it is intriguing to know what kind of simpler 
mathematical equation could include all these phenomena. In light of the 
foregoing comments it seems necessary to include at least the “breaking 
operator” of shallow water theory with the full dispersion relation of linear 
theory. Now the breaking of shallow water theory is given by 


300 
T+ Con. + πὶ M™Mx Ξ0, (13.125) 
2ho 


as noted in deriving (13.99). On the other hand, the linear equation 
corresponding to an arbitrary linear dispersion relation 


ω — 
- =c(k) 
was given in (11.12) as 
n+ f K(x—£) g(t) dé=0, (13.126) 
where 
K(x)= - [ἢ c(x)e™ dk. (13.127) 
2π. ὦ 
The two can be combined into 
360 οο 
n+ 5 mm,.+ f{  K(x—£)n(61) dé=0. (13.128) 
0 60 


The Korteweg-deVries equation takes 


c(K) = Cy— yk’, K(x) = ερδί(χ) + γδ΄ (). 


We now propose the improved description 


. 1/2 
Ci) = (© tanh who] (13.129) 
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1 οο g 1/2 
κι) ς [ [3 tanh who] οἰ dk, (13.130) 


This combines full linear dispersion with long wave nonlinearity. In terms 
of the parameters a and £ used in Section 13.11, we are retaining terms of 
all orders in βὶ and the nonlinear term proportional to a, while neglecting 
all higher powers of a and all product terms. We could in fact keep terms 
of all order in a by taking the nonlinear operator from (13.97) and using 
the combined equation 


ἡ, +{3Va(ho+n) ~2Vehy bn, + [Κα --ϑηφξιγάξ-το; (13.131) 


however, the gain is probably not worth the extra complication. It is easy 
to show by standard methods that the normalized function K, with 
g=1,h)=1 has the properties 


K,(x)= K,(— x), 


K,(x)~(2ax) “ as x—>0, 


| i173 
K(x)~(57°x] e~™/2 ἃς χ-»οο, 


ie K,(x) dx =1. 


The dispersive term is milder now, since K,(x) does not involve 
6 functions. Indeed the behavior of c(«) for large « controls the behavior of 
K(x) for small x; the change in the high wave number behavior replaces 
the 5”(x) by x~'/* as x0. The analysis of such nonlinear integral 
equations is quite difficult, and the one with K= K, has not yet been 
analyzed completely. But the question was raised in the general terms of 
what kind of mathematical equation can describe waves with both peaking 
and breaking. We can show that (13.128) can do this for some simpler 
choices of K(x) and infer that the same is true for K= K,. 

Steady profile solutions of (13.128) are obtained as usual by investigat- 
ing solutions with ἢ Ξ η(Χ), X =x — Ut. In normalized variables equivalent 
to taking g=h)=1, we then have 


(U-$a)n'(x)= [κια-)πο)φ, (13.132) 
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where y was set equal to €— Ut in the integral. The possibility of peaking 
arises because the slope η΄ may be discontinuous at 7=2U/3, and this 
would in fact give the relation between velocity and amplitude for the 
wave of greatest height. The equation can be integrated to 


3 CO 
A+Un-— a= ( K(X—y)n(y) dy. (13.133) 


Stokes waves for small amplitude can be found starting from the linear 
solution, and it seems reasonable to assume that in fact a critical height is 
reached when 7n=2U/3. If K(X) behaves like |X|? as X-—0 and 7(X) 
behaves like 2U/3—|X |’, a local argument in (13.132) suggests that 


2qg—l=ptq; (13.134) 


hence g=p+1. According to this, the crest would be cusped with 
n~3U-—|X|'/* for K=K,; a delicate result like Stokes’ 120° angle is too 
much to hope for in such a simplified model. 

Although this is as far as the case Καὶ K, has been taken, one can go 
further with an approximate kernel. A trick often used in such integral 
equations 15 to approximate the kernel by 


τρία! 
3 


K(x) = pe 


or by a series of these exponentials. The kernel K,(x) is the Green’s 
function for the operator d*/dx*—v*; hence the integral is eliminated in 
(13.133) when this operator is applied to both sides. We cannot match the 
singularity of K,(x) as x0, but we can take v=7 /2 to be reasonably 
close to the behavior as x—>oo and then choose μ so that 


0 2 
i) Ko(x) dx=— =1; 


that is, w=7/4, v=7/2. Since 


d°K,(x) 


ἐχ3 — ν᾽ Κρ(χ) = —»76(x), 


(13.133) becomes 
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This may be integrated once to the form 


4. Ae 
[ [,-- =n) n’? = quartic in 7. 


Periodic solutions correspond to oscillations of ἡ between two simple zeros 
of the quartic. Solitary waves are found in the limiting case when two zeros 
coalesce at »=0; then ἢ rises from this double zero (corresponding to 
X= +00) to a simple zero n= 7, say, and back to n=0 at X= —oo. All 
this is true provided 7=2U/3 is outside the range. It can be shown that 
the crests become peaked with a finite angle when ἡ =2U/3 just coincides 
with the upper zero. All the details will not be given. We merely note that 
the solitary wave of greatest height has 


| 


Ae alae U= 
(These are in units of length hy and velocity cy). The values obtained by 
McCowan (1894) in an approximate treatment of the solitary wave prob- 
lem are ηρΞΞ 0.78, U=1.249. One might say the comparison is reasonable. 
The crest has a finite angle of 110°. The finite angle agrees with (13.134), 
since p=0, g=1 for the kernel Ky; the closeness to Stokes’ 120° is surely 
fortuitous. However seriously one takes these numbers, it does show that 
an equation like (13.128) can describe periodic and solitary waves with the 
desired peaking. 

Turning now to the other kind of breaking, we note that Seliger (1968) 
was able to show by a rather ingenious argument that for kernels like 
K(x), a sufficiently asymmetric hump would break in the typical hyper- 
bolic manner. He required K(0) to be finite, however (as well as monotonic 
decreasing to zero as x->0o), and could not extend the argument to K,. 
PHUnY.S the argument is to consider 


m,(t)=minimum 7, at x= Χ (1), 


m,(t) =maximum 7, at x= Χ (1), 


(see Fig. 13.5), where m,<0O and m,>0. Then, on differentiating the 
normalized form of (13.128) and setting x = X,(1), we have 


dm, 


ae S m+ [  Κ(ξ)η,,.(Χ, EN dE=0, [π|,2. 
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mo my 


Xo (t) x, (t) 


Fig. 13.5. Notation in discussion of breaking wave. 


The integrals can be estimated in terms of m, and m, using the appropriate 
mean value theorem and we have 


dm 3 
<< - Smt + (m,—m,)K(0), 
dm 2 


From the sum of these, 
d ORO Gi ne) ae 
("πὶ Ὁ ma) < Cm —m) ( )+5(m m,) 1712. 


hence if (m,+m,)< —4K(0)/3 initially, it remains so and 


m +m <— + K(0) (13.135) 
for all t. Then 
a < _ 3.012. γι. K(0)— 4 KX) 
dt pa 3 
3 2 ee 
= 2 [κι rt $K(O)| - 2K). (13.136) 


The right hand side of this is negative, and in view of the m? term it is easy 
to show that m,—>— oo in a finite time; the details can be seen as follows. 
Let M=—3m,—K(0), then M=M,>0 initially [from (13.135) and m, 
> 0]; moreover, 


dM 2 
—— > 
dt A 
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from (13.136). Therefore 
M 
[Ὡς Si τρεῖς Hee Oy 


M-—c before ¢ reaches 1/M,. The conclusion is that if (13.135) holds 
initially, that is, if the hump is asymmetric enough, it continually becomes 
more asymmetric and breaks with m,—>— oo in a time less than 


ee et 
My —{(3/2)m,(0) + K(0)} 


Again, whatever the validity of the model may be, we have shown that 
equations of the form (13.128) can indeed describe symmetric waves that 
propagate unchanged in shape and peak at a critical height, as well as 
asymmetric waves that break. 


13.15 A Model for the Structure of Bores 


In those cases where breaking rather than peaking occurs, the result- 
ing bore takes two distinct forms. These are observed in.tidal bores and 
can be produced experimentally by an analog of the shock tube used in gas 
dynamics. In the experiments, first documented by Favre (1935), a gate 
separating water at different levels is pulled up suddenly and bores of 
various strengths are produced by varying the levels. The weaker bores 
have a smooth but oscillatory structure as shown in Fig. 13.6, whereas the 
stronger ones have a rapid turbulent change with no coherent oscillation. 
In both cases the end states satisfy the jump conditions (13.81). The 
change in type seems to occur sharply at a depth ratio h,/h,~1.28, 


corresponding to a Froude number U/ V gh, ~1.21. A general discussion 
of the overall balance of mass, momentum, and energy, allowing radiation 
of energy under the wavetrain, is given by Benjamin and Lighthill (1954). 
The actual structure is obviously complex, but again one might ask what 
kind of description would embody both forms. The Korteweg-deVries 
equation is the natural starting place, but it has no solutions like Fig. 13.6 
propagating unchanged in shape. Since dissipation is involved, it is natural 
to add a second derivative term and consider 


3 ἢ Ι 
m+ ef +5 i ne + FE CohOMxxx — μη,, = 0. (13.137) 
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This may not be a very close model to the frictional effects in water waves, 
but it is of interest in any case, since the Korteweg-deVries equation is a 
canonical one for the general study of dispersive waves. 

Steady progressing solutions are found from 


n=h s(x), X=x— Ut, 


and the ordinary differential equation for § may be integrated to 


din ἕω, 3.2. [Ὁ ,)ὲ- 
Ghibex— χε αἱ (2 i}s 0, (13.138) 


taking {0 as X00. A normalized form is 


Zep— MZ, + z*—z=0, (13.139) 
where 
1/2 X 3 
=16(F-1 — , = 
(13.140) 
1/2 
re, m=(=27 a 
In the phase plane with w=z,, we have 
a a dz =w. (13.141) 


dé dé 
Solutions with end states z=0, w=0, and z=1, w=0 are possible. These 


are singular points in the (z,w) plane. The solution curve 


dw _ mw—z*+z 
dz Ww 


must go from one to the other. In the neighborhood of (0,0) we have 
W~Ogz, 2xe%, a9= 5 (m— Vm? +4 ); 


this gives the exponential decay to zero as oo. In the neighborhood of 
(1,0), we have 


w~o,(z-— 1), (z—l)xe®, σι Ξ ΟΣ Vm?—4 ). 
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-ἰο "Ὁ O 5 4 
Fig. 13.6. Model bore structure. The solution of (13.139) for m=}. 


The approach to z=1 at ξ-- -- οὐ is oscillatory or not accordingly as m<2 
or m>2. Details may be pursued further, but we already see the two types 
of structure. For a fixed change of level, that is, for specified F—1, small 
enough damping allows an oscillatory solution, but large damping 
suppresses the oscillation. For fixed p, the criterion on F obtained from 
(13.140) would appear to be the wrong way around for water waves. 
However, in that case » would be interpreted as an eddy viscosity depen- 
dent on the mean flow velocity. A simple estimate of the form of depen- 
dence 15 4 = bu,h,, where u,,h, refer to conditions behind the bore and ὁ is 
a numerical factor. Then from the bore conditions 


4 
hy 3 OF D. 


The criterion m<2 for an oscillatory solution becomes (Ε-- 1)<3/(8b’). 
Favre’s critical value of F=1.2 would require b= 1.4, which is perhaps ten 
times greater than would be expected for an eddy viscosity. But we are 
making here only a crude model of the actual physical situation. The 
overall qualitative effect of dissipation seems to be mirrored correctly. 


CHAPTER 14 


Nonlinear Dispersion and the Variational Method 


The nonlinear effects found in the study of water waves are typical of 
dispersive systems in general. Periodic wavetrains, similar to those of 
Stokes and Korteweg-deVries, are found in most systems and these are the 
basic solutions corresponding to the elementary solutions ae“*~‘' of 
linear theory. In nonlinear theory, the solutions are no longer sinusoidal, 
but the existence of periodic solutions in θΞ Κα-- ὧἱ can be shown explic- 
itly in the simpler cases and inferred from the Stokes expansion in others. 
The main nonlinear effect is not the difference in functional form, it is the 
appearance of amplitude dependence in the dispersion relation. This leads 
to new qualitative behavior, not merely to the correction of linear formu- 
las. Superposition of solutions is not available to generate more general 
wavetrains, but modulation theory can be studied directly. The theory can 
be developed in general using the variational approach of Section 11.7. The 
formulation will be studied in detail in this chapter and the justification as 
a formal perturbation method will be given to complete the earlier discus- 
sion. Detailed applications of the theory are then given in Chapters 15 and 
16. 

Another specific consequence of nonlinearity is the existence of sol- 
itary waves. Waves with these profiles would disperse in the linear theory, 
but the nonlinearity counterbalances the dispersion to produce waves of 
permanent shape. Solitary waves are found, in the first instance, as limiting 
cases of the periodic wavetrains, but recent work on their interactions and 
their production from arbitrary initial data has shown that their special 
structure is of separate importance. We shall return to these topics in 
Chapter 17. 

For waves of moderately small amplitude in what might be called 
“near-linear theory,” further results may be obtained by perturbation 
methods based on small amplitude expansions. In particular we may return 
to the Fourier analysis description and study the small nonlinear interac- 
tions of the Fourier components. The interactions transfer energy between 
different components and, through product terms in the equations, 
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generate new components from existing ones. These interactions can be 
followed effectively when only a few components are involved. We shall 
include typical results as appropriate, but the main emphasis is on methods 
that extend to the fully nonlinear case. From a Fourier analysis viewpoint, 
the nonlinear wavetrains and solitary waves are already quite complicated 
distributions of Fourier components with involved interactions maintain- 
ing a balance. The developments emphasized here build directly on these 
special structures without attempting to disentangle them into their com- 
ponents. However, in the near-linear case there are interesting and inform- 
ative relations between the two points of view. 


14.1 A Nonlinear Klein-Gordon Equation 


It is useful to have a simple example to motivate and illustrate the 
Steps in the development of the general theory. For this purpose a non- 
linear version of the Klein-Gordon equation is particularly useful and is 
even simpler than the Korteweg-deVries equation, which would be the 
other obvious choice. We take the equation 


Pi — Prax t V'(y) ΞΟ, (14.1) 


where Κ΄ (φ) is some reasonable nonlinear function of m which is chosen 
as the derivative of a potential energy V(q) for later convenience. Equa- 
tion 14.1 is not only a useful model; it arises in a variety of physical 
situations. This is especially true of the case Κ΄ (y)=sin φ, which almost 
inevitably has become known as the Sine-Gordon equation! An account of 
the physical problems in which this form arises is given by Barone et al. 
(1971), following a briefer version by Scott (1970, p. 250). Its first 
appearance is not in wave problems at all, but in the study of the geometry 
of surfaces with Gaussian curvature K=—1. In fact some of the trans- 
formation methods developed there have been remarkably valuable in 
finding solutions for interacting solitary waves, as will be discussed in 
Chapter 17. More recent problems listed by the same authors include: 


1. Josephson junction transmission lines, where sin is the Joseph- 
son current across an insulator between two superconductors, the voltage 
being proportional to φ,. 

2. Dislocations in crystals, where the occurrence of sin ῳ is due to 
the periodic structure of rows of atoms. 

3. The propagation in ferromagnetic materials of waves carrying 
rotations of the direction of magnetization. 
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4. Laser pulses in two state media, where the variables can also be 
described in terms of a rotating vector. 


Scott further describes his construction of a mechanical model with 
rigid pendula attached at close intervals along a stretched wire. Torsional 
waves propagating down the wire obey the wave equation and the pendula 
supply a restoring force proportional to sin g, where φ is the angular 
displacement. Scott was able to generate the waves corresponding to many 
of the solutions of the Sine-Gordon equation. 

Equation 14.1 has also been discussed by Schiff (1951), with a cubic 
nonlinearity, and by Perring and Skyrme (1962), with the sin @ term, in 
tentative investigations of elementary particles. 

In this chapter the analysis applies for general V(q) with appropriate 
properties. The choice 


1 
V(p) = 59 + σφ΄ 


is both the simplest to bear in mind and the correct expansion in the 
near-linear theory for even functions V(q). The small amplitude expansion 
of the Sine-Gordon equation has o= — 1/24. 

We first check the existence of periodic wavetrains. They are obtained 
as usual by taking 


p=¥(6), θ-- Κχ-- οἴ. (14.2) 
On substitution we have 

(w?— k?) Vo + V'(¥) =0 (14.3) 
and the immediate integral 


5 (Ww k)Y+V(Y) =A, (14.4) 


We use A for the constant of integration, although earlier it was used to 
denote the complex amplitude in linear problems. Only the real amplitude 
a will appear in the same context, so there should be no confusion. Here A 
is still an amplitude parameter; in the linear case, V(V)= 47, it is related 
to the actual amplitude a by A = 4a. 

The solution of (14.4) may be written 


iis aay dv | 
ΕΙΣ K) | ΤΈΣ τὰν (14.5) 
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and in the special cases where V(WV) is either a cubic, a quartic, or 
trignometric, Ψ (@) can be expressed in terms of standard elliptic func- 
tions. Periodic solutions are obtained when WY oscillates between two 
simple zeros of A — V(W). At the zeros Y, =0, and the solution curve has a 
“maximum or a minimum; these points occur at finite values of 0, since 
(14.5) is convergent when the zeros are simple. If the zeros are denoted by 
W, and W,, we shall take the case 


Vi<¥<¥, A-V(¥)>0,  w?-k?>0 (14.6) 


for the present. The period in @ can be normalized to 27 (which is 
convenient in the linear limit) and we then have 


ms a dv 
2n={ 5 (u?—k)| reo (14.7) 


where ¢ denotes the integral over a complete oscillation of Ψ from Y, up 
to W, and back. The sign of the square root has to be changed 
appropriately in the two parts of the cycle. The integral may also be 
interpreted as a loop integral around a cut from V, to V, in the complex Ψ 
plane. 

In the linear case V(V)= 4, the periodic solution is 


2 
Vv =acos8, A= (14.8) 


2 » 
the amplitude a cancels out in (14.7), which becomes simply the linear 
dispersion relation 


w?—k?=1, (14.9) 


In the nonlinear case, the amplitude parameter A does not drop out of 
(14.7) and we have the typical dependence of the dispersion relation on 
amplitude. 

If the amplitude is small and V has the expansion 


V= Set σφ᾽... (14.10) 
we have 
Y= acosd+ 0a? cos30+ --", (14.11) 


w?—k?=14+30a7+---, (14.12) 
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me ae ee 14.13 
ya + goa’ + . (14.13) 


Α 
These are the Stokes expansions, which may be obtained either by direct 
substitution in (14.3)-(14.4) or by expansion of the exact expressions (14.5) 
and (14.7) obtained above. It should be noted that a is the amplitude of the 
first term in (14.11); it differs slightly from the exact amplitude 


ἜΝ ΗΝ 


8 


14.2 A First Look at Modulations 


In the basic case of one dimensional waves in a uniform medium, we 
saw in Chapter 11 that modulations on a Jinear wavetrain can be described 
by the equations 


aah ae (14.14) 
α΄ 8. (“' 42) -- 
a + 2 (Ca?) =0, (14.15) 


where w=w,(k) is given by the linear dispersion relation and C,)=w,(k) 1s 
the linear group velocity. (A subscript zero is added now to indicate the 
linear values.) The crucial qualitative change of nonlinearity is the depen- 
dence of w on a, which couples (14.14) to (14.15). For moderately small 
amplitudes, w may be expressed in Stokes fashion as 


W=W)(k)+0,(k)a7+---, (14.16) 
and (14.14) becomes 


9k 4 ry eae) 3Κ 8a* _ 
ay Ὁ (Wok) + 9(k)a?} το Ἐωγ(Κ) SR =0. (14.17) 


The important coupling term is w,(k)da7/0x because it introduces a term 
in the derivative of a; it leads to a correction O(a) to the characteristic 
velocities. The other new term merely corrects the coefficient of the 
existing term in 0k/dx and consequently contributes only at the O(a’) 
level. Similarly, for small amplitudes, the nonlinear corrections to (14.15) 
would be various terms of order a* which would provide corrections of 
relative order a’ to the coefficients of the existing terms in da*/dx and 
0k / dx. Therefore in the first assessment of nonlinear effects we can 
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proceed very simply, using only the new dispersion relation, and take 


0k, 0k daz 
. ae + wo(k) Ay + w,(k) τοῖς Ξε, (14.18) 


θα , da? » 20k _ 
ry + w(k) a τωρ (Κ)α ay Ξ0. (14.19) 
By the standard procedure of Chapter 5, the characteristic form of these 
coupled equations is found to be 


5 (k) 1/2 
1 | ®o 
~ (sgn wy ) dk + da=0 14.20) 
2 w(K) ! : 
on characteristics 
ΟΣ «ας (k) # (w,( k)eot (k) 7. (14.21) 


It may be verified that additional terms of relative order α added to 
(14.18)-(14.19) contribute terms only of order a? to (14.20)(14.21). 

This simple formulation already shows some remarkable results. In 
the case w,wg > 0, the characteristics are real and the system is hyperbolic. 
The double characteristic velocity splits under the nonlinear correction and 
we have the two velocities given by (14.21). In general, an initial distur- 
bance or modulating source will introduce disturbances on both families of 
characteristics. If the disturbance is initially finite in extent, for example a 
bulge on an otherwise uniform wavetrain, it will eventually split into two. 
This is completely different from the linear behavior where such a bulge 
may distort due to the dependence of C,(k) on k but would not split. 

A second consequence of nonlinearity in the hyperbolic case is that 
“compressive” modulations will distort and steepen in the typical hyper- 
bolic fashion discussed in Part I. This raises the question of multivalued 
solutions and breaking. 

When w,wg <0, the characteristics are imaginary and the system is 
elliptic. This leads to ill-posed problems in the wave propagation context. 
Among other things, it means that small perturbations will grow in time 
and in this sense the periodic wavetrain is unstable. The elliptic case turns 
out to be not uncommon and the modulation theory provides an interest- 
ing approach to some aspects of stability theory. 

We might note that for Stokes waves in deep water, the dispersion 
relation (13.124) gives 


1 
ὡρ(Κ) - 4. ἈΠ, ὠγ(1)- 4. (14.22) 
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so this is an unstable case with wow, <0. This is surprising in view of the 
long history of the problem and the sometimes controversial arguments 
about the existence of periodic solutions; throughout these discussions the 
instability went unrecognized. For the Klein-Gordon example (14.12), we 
have 


wol=(2+1), — w(k)= Sok? +1)” (14.23) 
The sign of ww, is the same as the sign of o; the modulation equations are 
hyperbolic for o>0 and elliptic for o<0. For near-linear waves, the 
Sine-Gordon equation has σ <0, so that in all the problems governed by 
this equation the near-linear wavetrains are unstable. 


We shall return to all these questions after the formulation of the 
modulation equations has been studied in detail and extended to the fully 
nonlinear case. 


14.3 The Variational Approach to Modulation Theory 


The complete modulation equations are obtained in a particularly 
compact and significant form from the variational approach started in 
Chapter 11. We first see how to implement it for nonlinear problems using 
the Klein-Gordon equation as a typical example. General procedures then 
become apparent and we include these in the justification of the method. 

In the Klein-Gordon case the periodic wavetrain is described by 
(14.4)-(14.5) and involves the parameters w, k, and A. We need to find the 
equations satisfied by these parameters for a slowly varying wavetrain. 
Equation 14.1 is the Euler equation for the variational principle 


ὁ {{{{ξε}- Σοῖ- γῳ)άάτο, (14.24) 


as is easily verified from (11.74). The elementary solution corresponding to 
the solution y= acos(#@+7) used in linear problems is p= Ψ(θ). [A phase 
shift ἢ can be added to (14.5), but it drops out of the modulation 
equations.] We therefore calculate the Lagrangian and its average value for 
p= (0); this is done keeping w, k, and A constant. We have 


L= + (wk?) ¥3— V(¥), 
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and the average value over one period in @ is 


r= [[ξω -ἰν- v(%) | 49. (14.25) 


In principle, the function Ψ is known completely from (14.5). However, we 
can avoid the integrated form and use (14.4) instead to express L as a 
function of w,k,A. We note the successive steps 


D2 2) Se GS 
aoe} (w — k*)¥ adg—A 
= ω»- KR) [Wd A 

2π 0 θ 


= + {2(0?— κὉ) {A-VO)} dv -A. (14.26) 


The final loop integral is a well-defined function of A, in which Ψ is now 
merely a dummy variable of integration. The notation £(w,k,A) is re- 
served for the final form of L. 

When w,k,A are allowed to be slowly varying functions of x and t, we 
propose the average variational principle, 


5 { { £(w,k,A) dxdt =0, (14.27) 
as before. This is viewed as a variational principle for 0(x,t) and A(x, 0), 
with w= —6, k=8@,; the variational equations are 
5A: eC, =9, (14.28) 
Oe au ΣΟ .Σ 
50: ry a ΕΥ̓ ef, =0. (14.29) 


After the variations have been taken, we again work with w,k,A, and add 
the consistency relation 


0k , Ow _ 
apt ay =0 (14.30) 
obtained by eliminating θ. The equations and their derivation from (14.27) 
are, of course, the same as in the linear case with the minor change of 
amplitude variable from a to A. The only new ingredient in the nonlinear 
theory is the calculation of &(w,k,A). 
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Equation 14.28 is a functional relation between w,k,A, which can only 
be the dispersion relation. For the Klein-Gordon example with © given by 
(14.26), we confirm that it does indeed give the correct result (14.7). The 
system (14.28)-(14.30) is the exact nonlinear form for the modulation 
equations tentatively proposed in the approximation (14.18)-(14.19). Be- 
fore discussing the properties of these equations and their various ex- 
tensions we turn now to the question of how the variational approach may 
be justified. 


14.4 Justification of the Variational Approach 


It will be sufficient to consider in detail the case of one dimensional 
waves described by a variational principle 


5 [{{{(φ,»φ..φ) dx dt =0. (14.31) 


The cases of more dimensions, more dependent variables and nonuniform 
media can all be treated similarly. The Euler equation for (14.31) 15 
0 0 


where the L, denote the derivatives 


pa, oe. ee (14.33) 

39, dg, ὃφ 
Equation 14.32 is a second order partial differential equation for φίχ, ἢ) 
and we assume that this has periodic wavetrain solutions of the 
appropriate type. 

For problems of slow modulations a parameter ε will be introduced by 
the initial or boundary conditions (as discussed in various cases in Section 
11.8); € measures the ratio of a typical wavelength or period relative to a 
typical length or time scale of the modulation. We shall eventually suppose 
ε to be small, but we make no restriction on the magnitude of the 
amplitude, only that its variations are slow. 

The first step is to describe a modulated wavetrain precisely. If x and ἢ 
are measured on the scale of the typical wavelength and period, the slowly 
varying quantities are functions of εχ, εἰ; modulation parameters such as k 
and w should be functions of this type. Yet φ itself varies due to the 
relatively fast oscillations as well. To incorporate these requirements, φ 15 
written explicitly as a function of a phase function θ and of εχ, εἰ. Then @ is 
chosen as e ἰθίεχ, εἴ) to provide the relatively fast oscillation and to give 
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the correct dependence of k=6, and w= — θ᾽, on ex,et. We therefore take 
y= 0(0,X,T; «), (14.34) 
θ-- ε“ 'O(X,T), X =ex,T=€et. (14.35) 

We define 7 
v(X, T)= —o( X, T)=9,, k(X, T)=0, (14.36) 


as the negative frequency and wave number. (In this general discussion we 
work with vy = —w to preserve the symmetry between x and t.) The scaling 
has been arranged so that 


dp ΞΞ οΦ οΦ ὃφ = ,o® + οΦ . 

δι .,8ὃ8ὃ.ὄ  ὃὉΤ᾽ ὃ 89 ϑΧ᾽ 
the variations due to the oscillations and to the slow modulations appear 
separately. 

In vibration problems for ordinary mechanical systems, x is absent 
and the method amounts to distinguishing two time scales explicitly. It has 
become known as “two-timing,” which is a colorful and convenient name 
even when “double-crossing” x variations are also involved. The art of 
two-timing lies in the fact that although one starts and ends with the 
correct number of independent variables, the expanded form can be used 
to advantage at intermediate steps. In the present case is ultimately a 
function of x and ¢ through (14.35), but in appropriate parts of the analysis 
® is treated as a function of the three variables 6,X,7T independently. In 
the usual two-timing procedures, the extra flexibility allows the suppression | 
of secular and other terms. Its use in conjunction with variational prin- 
ciples will be different but equivalent. 

The geometrical optics (WKBJ/) type of expansion discussed in Sec- 
tion 11.8 is equivalent to choosing 


p(x, t)~e® OO FS €"A4, (ex, εἰ) (14.37) 
from the outset. The two-timing version would work with 
©(0,X,T; «)~e”® Si €"A,(X,T) (14.38) 


to the same ultimate ends. In either case, the exponential dependence on 0 
is limited to linear problems. For nonlinear problems, the counterpart 
would be to take an expansion 


᾿ Φ(θ,Χ,Τ; ε)-- δ᾽ εἰφύ (9,Χ, ΤῈ, (14.39) 
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and determine the functions ®” successively. However, in the equivalent 
variational approach we make no such initial expansion; we work with 
(14.34)1(14.36) directly and avoid much of the tedious manipulation of the 
more standard perturbation procedures. 

When (14.34) and (14.35) are substituted into the basic Euler equation 
(14.32) we have 


dL, OL, dL, 81, 


——— +e— +k— +e—— —L,=0 14.40 
oo Or ὃ ὃ "Ὁ ie 
where the arguments of the L, are given by 
L,=L,(v®,+ eD,,kO,+eP,,®). (14.41) 


The relation 9@=«~ '@(X,T) was used to obtain (14.40), but it is now 
dropped. This is the crucial step in two-timing. Equation 14.40 is now 
considered as an equation for the function Φίθ, Χ, ΤῊ) of three independent 
variables θ,Χ, Τ. The equation also involves the function @(X, 7) through 
its derivatives y=@,, kK=@,; the original relations of Θ,ν and k to the 
argument @ in ® are also dropped. It is clear that if satisfactory solutions 
for ®(0,X,T) and @(X,T) can be found, then @(«~'O,X,T) solves the 
original problem. The extra flexibility in the choice of O(X, 7) is used to 
assure satisfactory behavior of ®(0,X, 7). 

The choice of Θ(Χ, ΤῊ will appear in different ways depending on the 
particular variant of the method, but they are equivalent. Here we shall 
impose from the outset the requirement that ® and its derivatives be 
periodic in θ. [Other variants leave O(X,7) open at first, find unwanted 
secular terms proportional to @ in the general expression for ®, and 
eliminate them by the choice of Θ.] The period may be normalized to 27, 
50 we impose the condition that ® and its derivatives be 27-periodic in θ. 
To implement this condition, we note that (14.40) may be written in 
conservation form as 


9 9 0 
ag (111 ΚΙ) Φ,-- L} + ear (Ὁ91.) +e ay (Φ, 12) 50. (14.42) 


Then, on integration from §=0 to 27, the contributions of the first term 
cancel, from the periodicity requirement, and we have 


ot Lf ®,L, d+ —— τς Lf” ®,L,do=0. (14.43) 


Equations 14.40 and 14.43 are the two equations for ®(@,X,7) and 
OX, T). 
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[t is a remarkable and surprising fact that these equations for ® and © 
are just the variational equations for the variational principle 


l 2a 
of f 5 [ (νῷ, εἐὦ, ΚΦ, + εῷ,, Φ)αθαχ 1 -το. (4.44) 


Variations ὃ ᾧ lead to 


0 9 


ο Ξ 


ἔφ, ax 
in the usual way, and with the particular form of L in (14.44) this is seen to 
be (14.40). Variations δ give 


“E+ <1, =0, (14.45) 


where 
= l 20 . 
Γ-ς- ἐ(νῷ,- εὦ, κῷ,- Dy, ΦῚ dd; (14.46) 
27 0 


this is (14.43). But, most striking of all, (14.44) is just an exact form of the 
average variational principle! Not only do we justify the variational 
approach, we obtain a powerful and compact basis for the entire perturba- 
tion analysis. Strangely enough, we have made no explicit assumption so 
far that ε is small. It is implicit, however, in the choice of the functional 
form of ® and the requirement that ® be periodic in @. 

In the lowest order approximation to (14.44), we have 


8 { { L©dxdT=0, (14.47) 


_ 2π 
T= [OL r®P), KO, } a, (14.48) 
0 


T 


The variational equations are 


δφῶ; (νι κι). LO =0, (14.49) 
50: 9. 74 ὃ. fo — 9. 14.50 
, ape + axle =% (14.50) 


these are the lowest order approximations to (14.40) and (14.45), of course. 
Since X,7 derivatives of ®® do not occur in (14.49), it is effectively an 
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ordinary differential equation for ®© as a function of 6. An immediate 
first integral [the corresponding approximation to (14.42)] 15 


{ pL{ + LS? } 0 - L© =A(X,T). (14.51) 


Equations 14.49 and 14.51 are just the ordinary differential equations 
describing the uniform periodic wavetrain, but with the difference that the 
parameters »,k,A are now functions of X,T. The dependence on @ is 
exactly the same as in the periodic wavetrain; the dependence of »,k,A on 
X,T provides the modulation. The explicit separation of @ from X,T 
automatically allows integrations with respect to θ in which p,k,A are held 
fixed; integrations such as those in (14.25) and (14.26) are now seen to be 
in this sense. 

When the solution of (14.51) is combined with (14.47)-(14.48), we 
have exactly the variational approach proposed earlier. It is now justified 
as the first approximation in a formal perturbation scheme. 

In the actual use of the method there is an important question of 
technique to be explained in general terms. As it stands (14.51) can be used 
to determine both the function ®© and the dispersion relation between 
p,k,A. [See (14.5) and (14.7) for the Klein-Gordon example.] The manipu- 
lations in (14.26) show that by limited use of (14.51) in (14.48) the explicit 
determination of ®® (which is just Ψ changed to the expanded notation) 
can be avoided and the dispersion relation can be incorporated as an 
additional variational equation derivable from (14.47). This is much to be 
preferred. For then the form of the average Lagrangian is simplified and, 
more importantly, a// the equations relating the slowly varying parameters 
v,k,A are collected in the variational principle. The question is how to 
describe this procedure in general terms. The problem is how to extract 
from (14.51) enough information on the functional form of ®® and not 
use complete information on the dispersion relation. We now show how 
this may be done. 


14.5 Optimal Use of the Variational Principle 
In the linear case there is no difficulty in separating the functional 
form of ®© from the dispersion relation. We know in advance that the 


solution of (14.49) or (14.51) will take the form 


& = acos(0+7), 


where a(X,T) is the amplitude related to A(X,7T) and used instead of it. 
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The phase parameter (X,7) will drop out in forming the average 
Lagrangian (14.48) and plays no role in this lowest order approximation. 
This rather trivial information on ®© is the only information extracted 
from (14.51) and does not include the dispersion relation. When this ®© jis 
substituted in (14.48) we have the function 


2a 
L(y, k,a)= "ἢ 1(-- να 51η0θ, — kasin0,acos@ ) 40 


for the average Lagrangian. 
In the near-linear case there is also no difficulty. We may use the 
Stokes expansion 


® = acos(0+7) +a,cos(O+7,) +a,cos(O+n3)+°°° 


as the required form of ®© without including the dispersion relation. The 
relations of a5,43,...,95,13,---, to a and ἡ may be taken from (14.49) or 
(14.51), or they may be left arbitrary and also determined from the 
variational principle. For example, in the Klein-Gordon problem with 
V(¢) given by (14.10), we take 


&© = χοο5θ- α:0053θ- ἄς ο550- ---. 


(It is easy to see in advance that the odd cosine terms are sufficient.) 
Then* 


21 
pon [ [ 5 (7 -κῆφο.. oo a op do 
0 


20 
4 
= zr? k?—1)a*—- oe + (2a 500°a;) + vee 


Variation with respect to a, shows that a,= ἐσα" in agreement with (14.11). 
On resubstitution of this expression for a, in L© we have 


ΓΞ 4 (9? 1a? — = oa‘ aoa + Le. (14,52) 


Variation of a now gives the dispersion relation (14.12). 

In the fully nonlinear case it is harder to disentangle the functional 
form of ®© from the dispersion relation. However, it can be done by use 
of a Hamiltonian version of the equations. 


*A term proportional to (vy?—k*—1)a? is omitted because the subsequent equations show 
that (ν2-- k?— 1)=0(a?). 
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Hamiltonian Transformation. 


The transformation will be applied here only to the lowest order 
approximation in (14.47)-(14.51), so to ease the notation we drop the 
superscript zero on all quantities. The idea is to eliminate the quantity ®, 
in favor of 0L/d®, just as ἃ is eliminated in favor of a generalized 
momentum p=0L/0dq in ordinary mechanics. A new variable is defined 
by 

01, 


oes 4. 
I] δῷ, vL,+kL,, (14.53) 


and the Hamiltonian H(II,®; v,k) is defined by 


01, 


ake ΓΙ 


—~L=,(vL,+kL,)-L. (14.54) 


From the transformation alone we have 


0® dH 
00 = oll’ (14.55) 
and (14.49) provides 
011 0H 
-, Ξ------ .- 4.56 
0 ὁΦ ie 


These replace the second order equation (14.49) for ® by two first order 
equations for ® and II. The variational principle (14.47) may now be 
written with 


r= ("ne 9 14.57 
Ee). | »— H)d6. (14.57) 


Moreover, there is an important extension. In the original form the 
variation 5®, is tied to δῷ; hence δὃΠ is tied to the variation of ® through 
(14.53), and (14.55) is a consequence of the transformation not a varia- 
tional equation. However, we simply observe that both (14.55) and (14.56) 
follow from (14.57) if ® and IT are allowed to vary independently. We are 
therefore free to make this extension. The next thing to note is that (14.51) 
is just the energy integral 


Η(Π,Φ; v,k) =A(X,T) (14.58) 
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for (14.55) and (14.56). Moreover, in this form it provides only the function 
II(®;»,k,A). 


Without using the relation of II to ®,, which has now been turned into one 
of the variational equations, there is no way to deduce the dispersion 
relation as well. This achieves the required separation of (14.51) into 
information about the form of solutions (now provided by the dependence 
of II on ®) and the dispersion relation. Finally, since the stationary values 
of (14.57) are known to satisfy (14.58), we may restrict the variations to 
functions which already satisfy (14.58). Then (14.57) may be evaluated as 


(v,k,A)= = fIl(®;»,k,A)d®- A, (14.59) 


and II(®;»,k,A) is the function determined from (14.58). The variational 
principle becomes 


δ { £(v,k,4) dx dT=0. 


The variation with respect to A is the only remnant of the variations of ® 
and II. The variational equations are now 


5A: 5, ΞΟ, 
0 


and the consistency relation 


0k Ov 


OT ax 


is added. These are the equations (14.28) — (14.30) with »= —w. 
In the Klein-Gordon example, 
L==~(v?—k’*)83-V(®), 


the Hamiltonian transformation is 


01, 


ὩΣ ΒΡ ate ΜΕΝ 
Ππ-- ; (v? — k?)@,, 

ἰὸς 22 _;217,2_ γ σοῖς ἃ 
H 606, L=5(? Κ΄) Π΄- V(®). 
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The integral H =A is solved as 


I1= (2(r?—k)}'"'(4 — V(@)}', 
and 


e= τ flldo-4 


= (ore) %6U- vO aaa 


in agreement with (14.26). 

Naturally the Hamiltonian transformation can be used also in the 
linear or near-linear cases. The expressions for © may then differ in form 
from those obtained previously, but of course the resulting variational 
equations are equivalent. 


14.6 Comments on the Perturbation Scheme 


The usual procedure in applying perturbation methods is to substitute 
suitable expansions in powers of ε directly into the differential equations of 
the problem, obtain a hierarchy of equations for the successive orders, and 
then take steps to ensure uniform validity. It was in this manner that the 
results of the variational approach were first verified by Luke (1966). The 
expansion (14.39) is substituted in the equation for φ to give equations that 
we may write schematically as 


E,{@9}=0, — E,{@,6} = F,{6}, δῃά so on. 


The zeroth equation for ®® is equivalent to (14.49). It is solved for &®; 
the dispersion relation is obtained between v,k,A, but their dependence on 
X,T is undetermined at this level. The equation for ®) involves only 9 
derivatives of ®” and so is effectively an ordinary differential equation. Its 
solution has unbounded terms proportional to @, unless conditions are 
imposed on F,{®}. These “secular” terms must be suppressed to ensure 
uniform validity of the expansion for large 8. The required condition on 
F,{®} leads to the further equation for »,k,A, which completes the 
lowest order solution. In the subsequent equations for the ®”, there are 
further parameters and further secular conditions. 

The prior requirement that ® be periodic is equivalent to the suppres- 
sion of secular terms. Therefore the successive approximations to the 
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periodicity condition (14.43) would appear as secular conditions in the 
more traditional procedure. We see the advantage of starting from (14.42) 
and (14.43) even if that procedure were to be followed. But, better still, 
since (14.42) and (14.43) correspond to the variational principle (14.44), the 
expansion can be substituted directly in (14.44) and the variational prin- 
ciple used to generate both the equations for ®” and the secular condi- 
tions. Thus the variational approach should not be considered as a 
Separate method. It includes the usual expansion approach, for which it 
streamlines the details and allows general results to be formulated. 

There are other advantages. The variational principle (14.44) has been 
established independently of any assumed form for the dependence on e. 
Furthermore, © may also be allowed to depend on e; it was taken 
independent of ε only for simplicity in the initial presentation. We may use 
expansions in powers of ε for ® or © or both, but we are also free to take 
other forms. For example, in the near-linear case, we may use expansions 
in powers of the amplitude, or, what amounts to the same thing, Fourier 
series for ®. This will be the choice in the discussion of higher order 
approximations in Section 15.5. 


14.7 Extensions to More Variables 


The extension to more space dimensions is immediate. The plane 
periodic wave solutions have p=¥(@) where 0=6(x,t) depends on a 
vector x and the propagation is in the direction of the vector wave number 
k=6,. The average Lagrangian becomes £(w,k,A) and modulations in 
space (i.e., slowly curving phase surfaces) also become possible. The 
modulation equations are (11.80)-(11.82). The justification of the last 
section requires only the obvious changes of replacing x,X,k by x,,X,,k, 
and performing the corresponding summations when necessary. 

The case of a single higher order equation goes through simlarly with 
Only minor extensions. There will be higher order derivatives in (14.31) and 
in all the later steps, but the extensions are straightforward. 

The case of more dependent variables requires detailed comment. 
First, for a linear system in a set of functions p(x, f), periodic wavetrains 
may be described by 


gp’? =a, cos6+ b, 51ηθ. 


The average Lagrangian calculated from this is a function of the two sets 
a,, b,, as well as ὦ and k. The corresponding variational principle 


bf { £(8,,6,,,d.b.) dxdt =0 (14.60) 
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leads to the variational equations 


0 0 »"» | 
0k; dk, 9k, 
Ot 0x; Ox, 9x; 


The set of equations ©, =£, =0 are linear and homogeneous (since & is 
quadratic in a,, ὃ.) and ‘they may in general be solved to express the a, and 
b, in terms of single amplitude a. These expressions may be reinserted into 
the Lagrangian to give © as a function £,(w,k,a), and the modulation 
equations are the same as in the single variable case. The substitution 15 
permissible, since the restricted choice for a, and b, does satisfy the 


stationary conditions. The equivalence may also be verified directly, since 


da, 0b, 


= ἢ a, + Fa Ὁ, =(Q 
aa, ab, 
Ce on dw ἫΝ Oa om τ "πων 


and similarly £, ἘΞ δ" In the course of the substitutions different expres- 
sions for (ὃ) may “result, depending on which relations are chosen, but the 
final equations are the same. The justification via two-timing proceeds as 
before. 

For nonlinear problems, the usual situation concerns a system of 
equations with a corresponding Lagrangian L{9{,p{,p™} involving 
only the y™ and their first derivatives. However, it is typical that for some 
of the φ, only the derivatives appear in L; they are “potentials” in that 
only the derivatives φ,, φ, represent physical quantities. This requires a 
highly nontrivial extension with important mathematical and physical 
consequences. In the uniform wavetrain solution any potential variable ᾧ 
must be expressed as 


@=B-x—yt+O(0), O=k-x—ot, (14.62) 


in order to ensure complete generality. The physical quantities involve only 


ᾧ,Ξ -- γτωῷ», $,=B+ko,, (14.63) 
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and —y,B represent the mean values. These are important physical 
quantities; in water waves they give the mean fluid velocity and mean 
height, for example. Moreover, a most important nonlinear effect is the 
coupling of modulations in the wavetrain to similar slow variations in these 
mean quantities. Thus in the modulation theory the term B-x— yt must be 
generalized to a function O(x, t) and y,f defined by 


(14.64) 


The function @ is similar to θ and is a pseudo-phase appearing in the 
problem. The quantities y and B are a pseudo-frequency and a pseudo- 
wave number. Furthermore, each potential @ has the term L; missing in its 
Euler equation 


0 0 


in the course of the analysis, this always allows an extra integral and an 
extra parameter B to be introduced similar to A. The triads (y,B,B) are 
similar, although subsidiary, to the main triad (w,k,.A). 

The two-timed form corresponding to (14.62) is 


φίχ, ἡ) =e [Θ(Χ, T)+ Φ(θ, Χ, T;€), 
where 


y(X,T)=—-0,, B(X,T)=O,, X=ex, T=er, 
and ® is chosen to be periodic in θ. For a Lagrangian 


L (φ;; Px: Ps Pps Px)s 


it may be shown that the two-timed equations and the conditions that ® and 
® be 2m-periodic in 8, are equivalent to an exact variational principle 
similar to (14.44). To lowest order it is 


sf [ L[L(-06,,ko,, 0, — y—wb,, B κῷ,) dddXaT=0 (14.66) 
0 


The variational equations corresponding to 6® and δᾧ determine the 
functions ® and ® and we have the two integrals 


(—wL,+k-L,—-wLl,+k-L,)®,—L=A(X,T), (14.67) 


—wl,tk-L;=B(X,T). (14.68) 
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The variations 6@ and 8@ lead to the two secular conditions 


0+ dg - 0 > δ -- 
ye ὌΧ. ὍΤΟΥ ox ΓΒ’ 

Finally, a Hamiltonian transformation can be introduced as before, based 
on generalized momenta 0L/0®,, dL /0®,, and (14.67)-(14.68) can be 
used to eliminate explicit dependence on ® and ® in favor of including the 
parameters A and B in the variational principle. We then have 


5f f (w,k,A,y,B,B)dXdT=0, (14.69) 
and the variational equations are 
eC, =9, Ce =0, (14.70) 
ὃ θ ee ΕἸ ΟΝ 5 
57 eo aX, PK Ξε 0, 57 ὧν aX, ee 0, (14.71) 
together with the consistency conditions 
0k; Ow 0B; ὃγ 
ak, ak, ap, ὃ, 
ax, Ox, 7 ox an (14.73) 


Further details of the procedure and examples are given in the original 
papers (Whitham, 1965, 1967,1970). An application to water waves on 
finite depth, where the extra parameters are crucially important, will be 
given in Chapter 16. 

In this more general case, the energy equation that corresponds via 
Noether’s theorem to the invariance of (14.69) with respect to shifts in T is 
now 


ap lwk,+ vl, — £)+ 3 ( ὡρ, — 724) =0. (14.74) 
J 


The momentum equation which corresponds to the invariance with respect 
to shifts in X, is 


0 ὃ 
OT ie Pie gy ὦ α᾿ Ὁ = 0. (14.75) 


The final extension is to note that if the medium is not constant but 
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depends on X,7, these will appear explicitly in the Lagrangian and 
therefore in &. But the variational equations (14.70)-(14.73) are un- 
changed. The conservation equations (14.74) and (14.75), however, have 
terms —£,,£,, respectively, on the right hand sides [as may be verified 
directly from (14.70)(14.73)]. 


14.8 Adiabatic Invariants 


It was remarked previously that the quantities ©,,£&, are analogous to 
the adiabatic invariants of classical mechanics. This correspondence can 
now be explored. In mechanics the setting is the theory of slow modula- 
tions for vibrating systems. The only independent variable is time, so in 
this case the modulations can be produced only by externally imposed 
changes in some parameter A(t). (This corresponds to a varying medium in 
the case of waves.) The classical theory is usually developed by 
Hamiltonian methods, which are not directly applicable to waves, but we 
may instead derive the simplest of the classical results by the methods 
developed here. For an oscillator with one degree of freedom q(t) and one 
slowly varying parameter A(t), the variational principle 15 


t 
| δ( (4, ,λ) 41 =0, 
ty 
and the variational equation is 
—f,—L,=0. (14.76) 


This case is covered by the arguments of Sections 14.3 and 14.4 simply by 
dropping the dependence on x. But it is useful to note the steps separately 
in the usual notation of mechanics. We follow the simple intuitive 
approach of Section 14.3; it is justified by Section 14.4. 

We first calculate the average Lagrangian for the periodic motion with 
A held fixed. If the period is r=27/v, then 


meee ioe 
ea 3 [ Ld. (14.77) 
In the periodic motion (A=constant), (14.76) has the energy integral 


gL,—-L=E. (14.78) 
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This may be solved, in principle, to express ᾧ as a function of (4, £,A) and 
then the generalized momentum p= L, can also be expressed as 


p=p(q,E,A). 
If (14.78) is used to replace L in (14.77), we have 


Vl pete 
c= {μα E 


=> φ p(q,E,A)dq— Ε, (14.79) 


where ¢pdgq means the integral over one complete period of oscillation [a 
closed loop in the (p,q) plane]. We now allow slow variations of A, with 
consequent slow changes of ν and E, and use the average variational 
principle 


δ "© (y, E,A)dt =0. (14.80) 
ἴρ 


It is again crucial to define ν as the derivative 6 of a phase θ() which 
increases by a constant normalized amount in one oscillation. This step 
looks perhaps less natural than in the waves case, but it becomes clear in 
the two-timing. The variations of (14.80) with respect to E and @ give 


d 
= —f£ = 14.81 
respectively. The first of these corresponds to the dispersion relation 


(14.28) and the second corresponds to the conservation equation (14.29). In 
view of (14.79) we have 


aot = 
= ee fp dq = constant, (14.82) 


which is just the classical result of the adiabatic invariant. As the system 15 
modulated, ν and E vary individually but 


Ι 
I(v,E)= 5— Pp dq (14.83) 


remains constant. From (14.79) and (14.81) the period is given by 


τα 55 aI, (14.84) 
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which is also classical. (An excellent account of the usual theory may be 
found in Landau and Lifshitz, 1960b, p. 154.) 

In the two-timed form (14.59), the quantity Π is defined as 0L/0®,, 
whereas the generalized momentum p is 0L/dq,. Since φ,Ξ νῷν, to lowest 
order, we have II= pp and the expressions (14.59) and (14.79) agree. 

It is clear from this comparison that in the case of waves ©, is akin to 
the adiabatic invariant and that the Pi are similar quantities ‘for spatial 
modulations. In waves there is no need for an external drain of energy, 
since modulations in time can be balanced by modulations in space. If the 
medium is not constant, however, we have the additional effect of 
parameters analogous to A, but the equation 


—£,—-~-&, =0 (14.85) 


still holds. The equation has become known as the conservation of wave 
action. 

In the special case of a wavetrain uniform in space but responding to 
changes of the medium in time we have 


¢ Ξ constant. 


Alternatively, for a wavetrain of fixed frequency moving into a medium 
dependent on one space dimension x, we have 


¢, =constant. 


These provide simple determinations of the amplitude. In general, modula- 
tions in space and time balance according to (14.85) and produce a 
Propagation of the modulations. 

The quantities ©, and Ce in (14.71) are similar to ©, and £, . They 
arise because of the extra dependent variables, just as ordinary Avadmical 
systems (involving only the time) may have further adiabatic invariants 
when there are more degrees of freedom. These wave systems have only 
one genuine frequency and so correspond to the degenerate cases of equal 
frequencies in dynamics. 


14.9 Multiple-Phase Wavetrains 
The general case of multiply periodic motions in dynamics would be 


mirrored in wave theory by wavetrains with more than one genuine phase 
function. It is straightforward to extend the formalism to this case but 
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questions of existence suggest caution. For a two-phase wavetrain, for 
example, the starting point would be a quasi-periodic solution 


p=¥(6,,0,), θ,Ξ- Κηχ -- ωγἱ, 0,=k,x—-w,t, (14.86) 


in which Ψ is 27 periodic in both θ᾽) and 6,. One would then go on to 
handle modulation theory as before. However, even in ordinary dynamics 
questions of the existence of quasi-periodic solutions are difficult ones in 
the nonlinear case, involving the well-known problems of small divisors, so 
there may be considerable difficulties hidden under the formalism. If the 
existence of solutions (14.86) and of neighboring modulated solutions is 
simply assumed, modulation equations can be developed as before. Ablo- 
witz and Benney (1970) and Ablowitz (1971) have pursued some of the 
consequences. Delaney (1971) notes that the variational formalism goes 
through. If modulated wavetrains can be described by 


p= 0(6,,0,,X,T;€), 
6,=€ 'O(X,T), 6,=€°'O,(X,T), 
it is straightforward to show that the two-timed equation for ® and the two 


periodicity conditions follows from the variational principle 


ὃ [ [Lax aT=0, 


τς 2a pla 
τες, ᾿ [ Lv, By + νχῷρ, + Dp, k, By + ky By, + εχ, B) dO, dO. 


Modulation equations can then be developed as before. 


14.10 Effects of Damping 


As in Hamiltonian dynamics, the variational formalism applies natur- 
ally to conservative systems; dissipative effects have to be tacked on a little 
awkwardly as nonzero right hand sides to the previous equations. How- 
ever, various canonical forms can be maintained and the left hand sides 
can still be written in terms of the Lagrangian. To illustrate this, we 
consider as a specific example the equation 


Pit — Prxx T γ΄ (φ) = eD(9,q,); 


where eD(q,¢,) represents small dissipative effects. The two-timed equa- 


510 NONLINEAR DISPERSION Chap. 14 
tion corresponding to (14.42) is 


Ι 
ab; 5 (0? k)03 + V(@)— 5702+ ὁ eax 


00 2 


+e. {1} + eb, ®,}- ἐπι {kD} εῷ,Φ.) 
= --εῷ,δ(Φ, νῷ, + €®,). 
To lowest order, we have 
5 (0?— k?)0}+ V(®)= A(X, T) (14.87) 
and the periodicity condition 
wh y@?2 do — ae kb? do = — [ Φρρ(ῷ, νῷ,)αθ. (14.88) 


From (14.87), ®, can be expressed as a function of ®, ν, k, A, and the 
integrals in (14.88) can all be written as loop integrals. We have 


a ee (14.89) 


where 
°(v,k,A) = + (200?) }'G{A-V(®)} "7? do—A, 


as before, and 


Ι 
δ) (ν,Κ,4}- 5- PD(®, &,) do. 
To (14.89) are added 


0k op 
Ὁ, =9, 57. OY = 0, (14.90) 
to complete the set of equations for ν, k, A. Equation 14.89 shows the loss 
in wave action due to dissipation. 

Here we have returned to two-timing on the equations but retained 
the canonical forms suggested by the Lagrangian for the conservative part. 
This is obviously less desirable than two-timing directly some extended 
principle. Recently Jimenez (1972) has had some success in deriving results 
such as (14.89) by Prigogine’s approach to irreversible systems (Donnelly 
et al., 1966). 


CHAPTER 15 


Group Velocities, Instability, and Higher Order 
Dispersion 


Most of Chapter 14 was concerned with questions of formulation. We 
now study the modulation equations and their solutions in some detail and 
emphasize the important differences between linear and nonlinear theory. 
In this chapter we consider the basic case of one dimensional waves in a 
uniform medium and, for simplicity, suppose that pseudo-frequencies and 
wave numbers do not arise. For the present the nonlinear Klein-Gordon 
equation and the problems noted in Section 14.1 can be taken as typical 
applications for the theory. More specific applications in nonlinear optics 
and water waves will be given in the next chapter. Extensions to more 
dimensions, nonuniform media, and higher order systems will be included 
in those particular contexts. 

Modulated wavetrains are described to all orders of approximation by 
the variational principle (14.44). In the lowest order approximation we 
have (14.47)-(14.48), and by means of the Hamiltonian transformation we 
obtain the average variational principle 


ὃ [ [ &(w,k,A)dx dt =0, 


where w= — 0, and 6,=k. (We drop the two-timing notation and revert to 
the earlier form, except when the precise ordering of terms again becomes 
an issue.) In this lowest approximation the variational equations for A,w,k 
are 


e, =0, (15.1) 
9 Od. 
0k , Ow _ 

ry ser eta (15.3) 


We first study these equations and then return to (14.44) to incorporate 
higher order dispersive effects. 
511 
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15.1 The Near-Linear Case 


Before the main discussion, we note how the near-linear equations 
obtained directly in Section 14.2 fit into the general formalism. The 
near-linear theory is obtained by expanding £ in powers of the amplitude. 
This expansion may be taken as 


2 = P(w,k)A + Ρ,(ω,Κ)43- e+. (15.4) 


but it would usually be derived from Fourier series, as in (14.52), in the 
equivalent form 


Ρ =G(w,k)a?+Gy(w,k)at+-+-. 
The dispersion relation ὃ, =0 is solved to give 
W=W(k)+w,(k)a7+---, 
where 


2G, (wo, k) 


G(w,k) =0, W> = G ἴω k) . 
w τ Ὁ: 


Equations 15.2 and 15.3 become 
9 0 ; 
9; { g(k)a*+---}4 ax { g(k)wg(k)a*+--- } =0, 


0k 


0 
τς {Wo(k) +w,(k)a*+--- } =0, 


where g(k)= G,,(wo,k) and the relation 


; =o. G;,( Wo, k ) 
wo(k) = δ, (wok) 


for the linear group velocity has been used. The coefficient g(k) can be 
factored out, in view of the second equation for k, and a sufficient 
approximation, as explained in Section 14.2, is 


ΠΕ ON area =0 
dx \ 


0k . 47, 0k da* _ 
5; + (Kk) se + w(k) ax Ξε. 
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The characteristic equations were found to be 


1/2 
wy (k 
1 are | (sgn w”) dk + da=0, 
(15.5) 
ΖΚ πος (1): (ωγ(γωρ" (k) } a. 


If (15.4) is used, we have the equivalent results with a replaced by A!/? and 
G,G, replaced by P,P, in the definition of w,. 
We now consider the exact equations (15.1)-(15.3). 


15.2 Characteristic Form of the Equations 


There are two useful forms of the characteristic equations depending 
on whether the symmetry between ¢ and x variables is maintained. If 
symmetry is maintained, it 1s convenient on this occasion to work with θ 
rather than w and k. Then (15.2) becomes 


ὌΝΟΝ 0, -2 Cook 6... + β κα θ.. " ey A, + CA A, =0. 


The derivatives A,,A, may be eliminated in favor of θ from (15.1) for, on 
taking its ¢ and x derivatives, we have 


is Sap 0, + OKA 6.x + CAA A, = 0, 
Cod θ. Bs KA θ. τ AA A, =0. 
The second order equation for θ᾽ becomes 


po, —278,,.+ 90. =9, (15.6) 
where 
P= Lead — Cas 
G= Cy Ban — Lia, 
r= Cage, 


Coy. 


From (15.1), A may be expressed in terms of 0,,6,, and then (15.6) is a 
second order quasi-linear equation for 9. The characteristics are 


dx _ ~rtVr'—pq 


dt Ρ 
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In the linear case 
& =P(w,k)A, 


p=-—P; q=— Pi, RE kes 


and we have the double characteristic velocity 


this is just the linear group velocity. The near-linear results can be 
recovered similarly. 

If x,t symmetry is abandoned, one useful possibility is to choose k 
and J= 2, as the dependent variables and suppose that 


πῇ,» 7Ξ--ῦ,; C4, =0 
are solved for the functions 

w(k,I), J(k,T), A(k,I). 
Then £ may also be evaluated as 


MW (k,I)= L {w(k,J), k,A(k,I) ie 
We have 


NM, =0,I-J, MW, =a,I; (15.7) 
hence from ,, = My, 
w,=J,. (15.8) 
The system of equations (15.1)-(15.3) reduces to 


k,+w0,k,, + 1, =9, 


(15.9) 
I+w0,1,+J,k, =9. 
The characteristic equations are 
VJ, dk+ Vw, dl=0 (15.10) 
on 
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This choice of variables keeps a much closer relation with the earlier 
discussions of the linear and near-linear cases. 

In the linear case, © = P(w,k)A, the dispersion relation P(w,k)=0 15 
solved in the form 


ὠτεωρ( ἈΚ), 
and we have 


Plo 


wW 


Since w,=0, the characteristic velocities (15.11) both reduce to wo(k). The 
system is not strictly hyperbolic, as noted earlier, because there is only one 
differential form, dk=0, provided by (15.10). However, once k(x,t) has 
been found, J is obtained from integration of 


1 +0)(k)I, +0% (k)Ik, =0 


along the same characteristics. 
In the near-linear case with & given by (15.4), we have 


0, =P(w,k)+2P,(w,k)At+--: =0, 
I=f,=P,(w,k)At:::, ὑπ --ῦ, Ξ -- διίω,Κ)4 Ἐ. 


These may be solved to give 


W=W(k)+0,(k)I+---, 
(15.12) 
J=ug(k)I+-:-. 


The characteristic velocities (15.11) are 


PE = (ΚΑ) + Vers (Κγῶχ(Κ)}1 boo (15.13) 


These check with (15.5), for if J=g(k)a’, then 


W=Wo(k) +w(k)a?+---, 
with 


w,(k) =32(k)o,(k), 


and (15.13) becomes the same as (15.5). 
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Hayes (1973) notes that if a partial Hamiltonian transformation 
K(k) =wl,—-& =wl— & (15.14) 
is introduced at the same time as k and I, we have 
ὑπ, w=K,. (15.15) 


These may be seen also from (15.7) with St τε ὦ] -- KC. The equations 
become 


0k , 90 _ 
apt ae JC, =, 
(15.16) 
᾽ς C20 
δ ax” * 
The characteristic equations are 
V JG aka V ὦ, al=0, 
(15.17) 


d. y/ar ar 
a= Wry, Hay Sy - 


In specific cases, other choices of variables may lead to the simplest 
expressions. For the Klein-Gordon example (14.26), 


β =(w?—k?)' F(A) —-A, 
(15.18) 


F(A)= -ἰ G{2(4-V(¥))} Pa: 


it turns out that the phase velocity U=w/k and A are the most convenient 
variables to work with. From the dispersion relation & , =0, 


_ _ U | 
(U?-1)'? F(A)’ (U?—1)'7 F(A)’ 


equations 15.2 and 15.3 become 
οι νι, 8ὲ᾽ ὦ 
or (υ-- 1)? Ox (U?— i 


δ τα ad [πρὶ 
δ 1 (υ3-- 6} ὃχ ((υ2-.- 2 Ρ’ 
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The characteristic equations are found to be 


1/2 
dU__(_F" 
i +(-4) 44-ὸ (15.19) 


dx 1+ U(- FF"/F?)” 


(15.20) 
at U+(-F m | pry? 


More Dependent Variables. 


When there are more dependent variables and more equations, as in 
(14.70)-(14.73), the number of characteristics is increased corresponding to 
the order of the system. The additional characteristics refer to nonlinear 
coupling of the wavetrain with changes in mean background quantities and 
are quite different from the linear group velocity. The formulas for those 
two velocities (associated primarily with propagation of k and A) that do 
correspond to the linear group velocity are considerably modified. In 
particular, in these cases, the type may be incorrectly given if the extra 
dependence is overlooked and the simpler formulas above are used. 
General formulas for the characteristics will not be developed, since the 
most useful choice of variables depends strongly on the particular problem 
at hand. Typical examples are provided by the later discussions of the 
Korteweg-deVries equation and Stokes waves in water of finite depth. 


15.3 Type of the Equations and Stability 


The type of equations can be read off according to whether the 
characteristics are real or imaginary. The condition for the system to be 
hyperbolic may be taken in any of the equivalent forms 


r?—pq>0, w,J, >9, Dg: nee > 0, (15.21) 


the second being closest in form to the near-linear result w,wg >0. With 
opposite signs the system 1s elliptic. 

As remarked in Section 14.2, the periodic wavetrains are unstable in a 
certain sense when the modulation equations are elliptic. To see this, we 
note that the modulation equations take the general form 


ie ot 9 15.22 
a tai z= . ( : ) 
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In a uniform periodic wavetrain u takes constant values u, say. For small 
perturbations we take u=u©+u. The linearized equations for u are 


du) dus? 
: (). 7 — (0) — (0) 


This system has solutions 


wa e ἱμία -- ο 
where 


la — C8,,| =0. (15.23) 


The possible values of C are the characteristic velocities evaluated for 
u=u [see (5.12)]. If any of the C’s are complex, some of the solutions of 
u” grow exponentially in time. Of course as usual in simple linear stability 
analysis this only indicates large deviations from the uniform state will 
occur, not necessarily that the wavetrain becomes chaotic. In the present 
context, the stability and the possible eventual states are considerably 
affected by higher order terms in the modulation approximation, as will be 
discussed in Section 15.5. 

For the nonlinear Klein-Gordon equation (15.20), for wavetrains 
satisfying (14.6) with F(A)>0, we have 


Hyperbolic: Fe «0, 
(15.24) 
Elliptic: F”>0. 


In particular, when V(V¥)=4¥7+ 0", the system is hyperbolic for o>0 
and elliptic for σ «0. For any even function V(), the first terms of the 
near-linear expansion can be put in this form and the type depends in the 
same way on the sign of o. 

For the Sine-Gordon equation V(V)=1—cos¥, it may be shown that 
F"(A)>0; the periodic wavetrains are unstable. This result applies to 
oscillations about ”=0, which satisfy (14.6). We shall later note the 
existence of helical wavetrains in which WV increases or decreases mono- 
tonically. These give periodic solutions, since the same physical state is 
recovered after each change by 27. They turn out to be stable in the sense 
considered here. 
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15.4 Nonlinear Group Velocity, Group Splitting, Shocks 


In the hyperbolic case, the characteristic velocities are taken to define 
the nonlinear group velocities. This is the natural extension of the linear 
case. The splitting of the double characteristic velocity of linear theory into 
two different velocities is probably the most important and far-reaching 
result of the theory. As noted in Section 14.2, it predicts the eventual 
splitting of a modulation of finite extent into two separate disturbances, a 
result quite different from linear theory. In problems where the linear 
group velocity is positive, the two nonlinear group velocities will usually 
both be positive also. Then modulations introduced by a source at the 
origin will propagate on both families of characteristics as shown in Fig. 
15.1. Ideally, we take a source at x=0 producing a highly nonlinear 
wavetrain up to ¢=0; it then modulates amplitude and frequency for a 
finite time 7, after which it returns to producing the original wavetrain. 
Notice that two boundary conditions should be applied at x =0, so that 
independent distributions of a and w may be introduced. By the usual 
arguments of Part I, there will be a certain interaction period, but the 
disturbance will ultimately separate into two simple waves on the C, and 
C_ characteristics as shown. This is analogous to Riemann’s initial value 


problem (Section 6.12). 


simple wave on G_ 


simple wave on Gy 


x 


Fig. 15.1. Group splitting. 


We can make an estimate of the distance to separation in terms of the 
difference between the characteristic velocities. We have 
ς»- 
τς ὦ 


ΧΕ 


where C.. are typical values of the characteristic velocities and fy is the 
time the modulation lasts at the source. The velocities in the near-linear 
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theory (15.5) provide the upper estimate 


12 
Wg tq 


nal) 1/2 . 
»Ἤ 
2α(ω,ωῤ 


(15.25) 
It would be extremely valuable to have experimental evidence of this 
separation, since it is of fundamental importance in assessing the modula- 
tion theory. Other related nonlinear effects have been observed in non- 
linear optics, but this one does not appear to have been pursued yet. 

Simple wave solutions, produced as described above or otherwise, can 
be obtained analytically by the standard theory of Part I. One Riemann 
invariant is constant throughout, and the modulation variables w,k,a 
remain constant along the individual characteristics of the appropriate 
family. In the linear theory, k remains constant but axr7'/? along the 
characteristics. This difference between nonlinear and linear behavior is 
probably less easily detectable than the group splitting, and may be 
partially masked by higher order effects. 

Finally, among the group of hyperbolic topics, we have the question 
of breaking and shocks. The dependence of the characteristic velocities on 
the modulation variables introduces the usual hyperbolic distortion, and 
“compressive” modulations in a simple wave solution will develop multi- 
valued regions. What happens then is not clear at present. Unlike the 
problems treated in Part I, there is no objection to multivalued solutions as 
such. They would be interpreted as the superposition of two or more 
wavetrains with different ranges of k and a. The actual solutions would not 
be described correctly by (15.1)-(15.3), since these equations were derived 
presupposing a single phase function. But they would presumably be 
covered by the original equation. Certainly superposition is what happens 
in the linear case. Although the question was not raised in the earlier 
discussions, it is conceivable to set up a wavetrain with the linear group 
velocity C,(k) decreasing toward the front. Then, since values of k propa- 
gate with velocity C(x), there would eventually be overlapping parts of the 
wavetrain. There is no problem about superposition in the linear theory 
and the whole process could be studied by the exact solution in Fourier 
integrals. Although the corresponding process would be hard to follow 
through analytically in the nonlinear case, this qualitative behavior seems 
perfectly feasible. Something like the multiphase solutions referred to in 
Section 14.9 would be needed within the overlap region, but the transition 
process poses a difficult problem. 

A second possibility is that higher order terms in the modulation 
approximation become important near breaking and prevent the devel- 
opment of a multivalued solution. It is easy to see in general (and will be 


“--ο. 
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shown in some detail in the next section) that higher order effects typically 
introduce additional terms involving third derivatives into (15.2) and 
(15.3). They become similar in form to the Boussinesq and Korteweg- 
deVries equations. By analogy, it is expected that breaking is suppressed 
by the extra terms. Of course, just as in the case of water waves, the 
additional terms are introduced as small corrections for long length scales, 
and are the first terms in an infinite series of higher derivatives. To accept 
their dominating effect on breaking in all cases may be inconsistent. It is 
more likely that this applies to small symmetric modulations and these 
develop into a series of solitary waves, whereas strong unsymmetrical ones 
break in some sense. 

Finally, we have the question of shocks. Formally, discontinuities in 
w,k,A can be allowed in the solutions of (15.2) and (15.3). These would be 
interpreted as weak solutions and the shock conditions would be taken 
from appropriate conservation equations as described in Section 5.8. This 
is the most fascinating possibility theoretically, but it is probably least 
likely as a description of reality in this particular context. The unsureness 
as to interpretation also makes the choice of appropriate shock conditions 
less clear. Equations 15.2 and 15.3 are already in conservation form, but 
the equations for conservation of energy and momentum are also obvious 
candidates. The latter are 


3 3 ᾿ 
τωρ, - δ)- = (k,)=0, (15.26) 


3 3 Ε 
(ke) τ (δι - Y=. (15.27) 


There are, in fact, an infinite number of conservation equations. However, 
(15.2), (15.3), (15.26), and (15.27) are the only ones with clear significance. 
Our system is essentially second order, [(15.1) is not a differential equa- 
tion], so two conservation equations must be chosen to provide the two 
shock conditions. The choice is tied to what the shocks are supposed to 
represent. If they are taken to be approximations to solutions still covered 
by the original detailed equation for g, then we should choose shock 
conditions from (15.26) and (15.27), the argument being that energy and 
momentum are conserved in the more detailed description for g and 
should therefore be retained in the slowly varying approximation. The 
shock conditions would be 


U,[wof,, — &]+[wl,]=0, (15.28) 


U[kEJ+[K£, — £]=0, (15.29) 
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where U, is the shock velocity. Conservation of phase, (15.3), and conser- 
vation of wave action, (15.2), could not then be maintained across such a 
shock. These were derived for a special form of solution, assuming slow 
variations, so there is no objection to abandoning them across abrupt 
shocks. These shocks would therefore represent a source of oscillations and 
involve jumps of the adiabatic invariants; the latter brings to mind the 
quantum jumps of the adiabatic invariants in quantum theory! It should 
be emphasized again that this is all just formalism with no positive view as 
to the structure of these shocks or even need for their occurrence. 
Moreover, these discontinuities would have irreversible properties, yet the 
original equation for φ is reversible. This would be another example of the 
long-standing problem of how systems reversible in some fine scale of 
description can exhibit irreversibility in “macroscopic” levels of approxi- 
mation. 

If, on the other hand, discontinuities are supposed to represent 
phenomena not covered by the original equation, but covered by some 
even more detailed description involving dissipation of some kind, then the 
choice would be different. Although momentum would probably be con- 
served, energy presumably would not. It is unlikely that (15.2) is the 
correct alternative, but one could make a case for (15.3). With dissipation, 
smooth oscillatory changes between different constant states may be con- 
structed in dispersive models, as shown in Section 13.15. In that case the 
end states are constant, since the dissipation also damps out the oscilla- 
tions on the two sides of the transition region. Thus it does not represent a 
change of state within an oscillatory wavetrain as contemplated here. But it 
does indicate the existence of a single-valued phase function and adds 
weight to the choice of (15.3) for a possible shock condition when dissipa- 
tion is involved. Again this is entirely speculative and it would be pointless 
to go further in this direction without some more definite information and 
results. 


18.5 Higher Order Dispersive Effects 


We now consider modulation equations in the next order of 
approximation beyond (15.1)-(15.3). For simplicity, we work with the 
example 


Pn — Pex + P+ 40H° =0 (15.30) 
and limit the analysis to the near-linear case. But the results are typical and 


more specific physical applications will be shown for nonlinear optics in 
Section 16.4. 
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The variational principle for (15.30) has the Lagrangian 


ai ent ee oe «τὴ 
L= TO τ Px FP σφ᾽ (15.31) 


and the exact modulation equations are given by 


ὃ { { Lax dT=0, (15.32) 
— 1 τὰ l 2 1 2 1 .} 4 
beg} | 5 (wy + 7) — 3 (kb, + by) — 5 — σῷ | db, 
(15.33) 
as shown in (14.44). (We revert to the usual frequency w= -- ν.) For the 


near-linear theory we may use the Fourier expansion 


® = acos@+a,cos39+a,cos5é+ ---, 


as in deriving (14.52). But we now retain also the next order terms in e. The 
coefficients a, are proportional to a”, and this case is particularly simple, 
since to the order of approximation needed here only the term acos@ 
contributes. We have 

L= i ω;- k*?—1)a*— 2 σαν 1<(ah— αἱ) +0(a°,e7a*). (15.34) 
This is a double expansion in which ε and a are assumed to be of the same 
order.* The linear term is 1(w*— k?— 1)a’, the first correction of nonlinear- 
ity is — 30a*, and the first correction of higher order dispersion is 4€*(a7— 
a2). The variational equations are 


δα: (w*— k?— 1) a—30a°— ε᾽(α,1-- αχχ)Ξ0, (15.35) 


Di Hn ng On ω»,32)-- 
50: 57 (wa )+ ay (ka y=0, (15.36) 
: dk , dw _ 
Consistency: oT + ay 0. (15.37) 


It should be noted from (15.35) that @(X,7) now depends also on e, and 


*Equivalently one could leave a as an 0(1) quantity, take o as a measure of the nonlinearity, 
and use a double expansion in the small parameters o and εὖ. 
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that a rigid separation into a hierarchy of equations for the different orders 
of « has not been imposed. The variational principle (15.32) is exact and we 
have merely implemented it to the order of approximation noted in (15.34). 

In this particular case, the higher order modulation equations are 
more complicated in form than the original equation (15.30)! Nevertheless, 
the behavior of modulations can be seen more easily from them than from 
the original equation. Of course it is usually the case that there is consider- 
able simplification in going to the modulation equations. In any event, 
(15.35)-(15.37) are typical for systems in general. 

When the terms in ε are omitted, the modulation equations are 
hyperbolic for o>0 and elliptic for «<0. The higher order dispersion 
effects introduce third derivatives of a into (15.36)-(15.37), and the modu- 
lation equations themselves become dispersive. In the case o >0 they have 
a similar structure to Boussinesq’s equations. The consequences for break- 
ing are expected to be similar, as discussed in Section 15.4. The existence 
of periodic solutions and solitary waves will be noted shortly. First, 
however, we consider how the instability found in the elliptic case σ <0 is 
affected by the additional terms. A uniform wavetrain is a solution with 
constant values ow, 4, a, satisfying the dispersion relation (15.35). For 
small perturbations w, ka, about these values, the linearized equa- 
tions (15.35)-(15.37) are homogeneous with constant coefficients involving 
wk, a©. There are elementary solutions with w,k,a™ all propor- 
tional to e*“~ ©"), provided C satisfies 


(oy 2,,2 
(ωῦς--κογ-α- cP a + “τι = c)| =0. (15.38) 


The parameter » determines the wave number for the modulations. For 
small a® and ε, the values for C are 


© 1 (30a εἾ2)" 
Ξ τες (15.39) 


C=5_+—_ 
wo ~ 2 2 Ag? 


With the term in p neglected these are just the characteristic velocities, 
imaginary for o<0. The influence of the dispersion introduced by the 
correction in p is stabilizing, and the instability is now confined to the 
range 


0«εἶμ < 6lalw72q (15.40) 


in the modulation wave number ep. 
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For both the cases o >0 and o <0 it is important to observe that the 
system (15.35)}(15.37) has steady profile solutions propagating without 
change of shape. They are found in the usual way as solutions in which all 
quantities are functions of a moving coordinate Χ — VT. We have 


e2(1— V2) a" + (w*— k?—-1)a—30a*=0, (15.41) 
(wV—k)a’=R, (15.42) 
w- Vk=S, (15.43) 


where R and S are constants of integration. The last two may be used to 
eliminate w and k in the first to give 


R?— S7a*+ (1— V?)(a*+30a°) 


a> 


e(1—V?)’a"= (15.44) 
In general, there are periodic solutions (wavetrains in the envelope of the 
original modulated wavetrain!) in which a oscillates between two simple 
zeros in the numerator of the right hand side. Solitary waves will be 
limiting cases. 

The solitary wave with a0 as X-»+ 00 is particularly interesting; it 
represents a wave packet as shown in Fig. 15.2. In this case, (15.42) has 
R=0, since a—0 at oo. Hence 


: (15.45) 


then, in turn, from (15.43), w and k are both constant. For this example, 
the linear dispersion relation is w)=(k7+ 1)'/2 and the linear group veloc- 
ity 15 


Cee i 


ἢ (k*+ ie 7 Wo 


We see that the velocity V is a nonlinear counterpart of Cp; it will be close 
to it for small amplitudes. Since C)<1, we may take V<1. With w,k 


Fig. 15.2. Solitary wave modulation. 
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constant, (15.41) for a integrates once to 


2(1—V2)a?= 5 oa*— (w?— k?=1)a?. (15.46) 


For a—0 at oo, we need 


w?—k?-1<0. 


At the maximum value of a, we have 


w= 1= S002, (15.47) 


m 


so that solitary waves of this type exist only for the elliptic case o<0. The 
velocity (15.45) of the packet is 


~ kk 3.» lela 
(κι 4 (24177? 


it moves a little slower than the linear group velocity. Ostrowskii (1967) in 
an analogous problem in nonlinear optics proposes that the result of 
instability may be a periodic solution which is essentially a sequence of 
such wave packets. 

In the hyperbolic case o >0, these extreme solitary waves do not exist, 
but periodic wavetrains and solitary wave solutions of (15.44) with a 
bounded away from zero can be found. This is reasonable, since for o >0 
the modulations distort in the hyperbolic theory (e=0) but do not grow. 
The higher dispersion can counteract the distortion to produce steady 
profiles, and there is no reason for them to grow to the extreme case with 
a=Q. Their existence supports the view that in some cases breaking will 
not occur but rather the waveform will tend to a steady oscillatory form in 
the breaking region. 

In near-linear theory the higher order dispersive terms arise from the 
quadratic part of the Lagrangian and it is easy to show that the general 
form corresponding to (15.34) is 


L = G(w, k)a? + G,(w,k)a* + se G 07 +2G καγαν + Gaz}. 


The variational equations are similar to(15.35)-(15.37). 
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15.6 Fourier Analysis and Nonlinear Interactions 


If the amplitudes are small and if only a few Fourier components are 
present, the nonlinear interactions between the components can be studied 
directly. This provides an alternative approach to some of the foregoing 
results. It was by this approach that Benjamin (1967) found an instability 
result of the type given in (15.40) for Stokes waves in deep water. The 
details for water waves by both the modulation and the interaction 
approaches will be discussed in Section 16.11. Here, to illustrate the 
method, we apply Benjamin’s arguments to the Klein-Gordon equation, 
where the algebra is simpler. 

For a finite number of Fourier components, φ can be expressed in the 
form 


φ- ἢ De(ne™, (15.48) 
where ν runs over +1, +2,..., +N and we take κ᾿, Ξ -- κ,» Φφ. ,κ()Ξ φί(η, 
n=1,...,N, to ensure a real gy. For the equation 

Pir Pxx rp 4σφ", (15.49) 


the linear solution (which neglects the right hand side) is 


g,(t)=A,e "ἢ, (15.50) 
where 


w =(n2+1)", w.,=-0, A_,=AS, (15.51) 


and the A, are constant. The near-linear theory can be developed by 
assuming o to be a small parameter. 
A naive perturbation expansion would express p as a power series 


p= 9 +09 + op + - 


to give the hierarchy 


G.-Y Ἐφ =0, (15.52) 


φί" -- φί +g? = -- 49, (15.53) 
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and so on. The solution of the linear equation ( 15.52) would be taken in 
the form 


g= 5 3A, εἰναι τίωμ 


and substituted in the right hand side of the equation for φί, However, 
resonance produces secular terms in gp“ and the expansion is not uni- 
formly valid. This is, in fact, just a more general version of the situation 
noted while discussing the Stokes expansion for periodic solutions in 
Section 13.13. A uniformly valid solution is obtained by including any 
third order resonant terms at the earlier level in the equation for @®. It 
amounts to adopting a more definite Fourier analysis viewpoint and 
grouping terms according to their contributions to the different com- 
ponents e*. That is, we substitute (15.48) in (15.49), and for each of the 
original components οἷ we have 


+up,=-5 Σ᾽ φ,φρφ,, (15.54) 


dt* Κα + kg t ky =k, 


where w, is the same as in (15.51). The cubic term will also generate new 
components to be added to (15.48), but they do not resonate (at least at the 
cubic order) and may be neglected in the first approximation. We therefore 
consider the solutions of (15.54). 

To take out the main oscillation, we introduce 


g(t) = A,(t)e~'*", 


with the important difference from linear theory that A,(t) is still a 
function of t. We have 


d*A dA | 7 
eae ς — 2iw, — =-—g > A,AgA, eller Θὰ τ ὡρ ων} 
dt” dt Κα tkgt+k, =k, 


The time scale of concern is now θ(σ΄ ἢ) and each ¢ derivative introduces 
an order o. Hence it is sufficient to drop the second derivative of A, and 
take 


Pe FA Ag Ace) (15,55) 


dt 2w, Ky + kg + kK, =k, 
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True resonance occurs when 


κ Ἔκ, ἜΚ. Ξ κὶ. 
ee ae (15.56) 
W, + We τω, = w,. 

The self-interaction 
(k,,K,, —K,)kK, 


po “sp? 


(and its permutations) are always of this type. They produce the Stokes 
effect on the frequency. If only one mode κρ were present, we should have 


44. 316 δ ς 7 1/2 
---- Ξ-:- -- ------ = + 
dt Deo, 1040 Wo (κὸ 1) ) 
with solution 
Ao = age ~ 8i2/2#00A0!, (15.57) 


Then 


3 σαῇ 


This is the Stokes result (14.12). 

In his stability discussion, Benjamin considers the effect of close “side 
bands” with wave numbers x,+ ji on a main wave kg. That is the set of ,,, 
n=1,...,N, is {ko,ko—fi,Kg+ ji}, and their negative values are added to 
make up the full set k,. We denote the corresponding A, by Ay, A_, A,; 
their conjugates appear as in (15.51). The equations (15.55) become 


dA | | 
—9 = ἰδ (34248464 9A ,A% +649A_A* Ἐ6ΑΆζ4..4. 6"), 
at 2ωρ 
(15.58) 
dA _ | | 
Go τ ἰδάράξΆ. + 345A Se +64 ASA _ +3A42,A*}, (15.59) 
where 


Q=w, ἕω. — 2p; (15.60) 


the equation for A, is obtained by interchanging plus and minus sub- 
scripts. The interactions involved can be seen from the A’s that contribute. 


530 GROUP VELOCITIES AND HIGHER ORDER DISPERSION Chap. 15 


The first term in (15.58) is the self-interaction (Stokes) term; the second 
term comes from (kg, κο + jl, — Ky — jt) —>Kp, and so on. The terms with the 
factor οἰ do not exactly satisfy the resonance condition on the frequen- 
cies. But if Κὶ is small, so is Q, and the factor is kept to preserve uniformity 
as 0. The numerical coefficients in front of each term give the number 
of permutations in any particular interaction. 

In a stability analysis, it is assumed that A, «4, and the equations 
are linearized to 


44. 31σ 24x 
Ge gg AO 
dA 


= id i 
ns {6A,AgA_+3A45A*e@} | 


The effects on Ay are second order. We take 


Ag dei, ρ-- 3 (15.61) 


as in (15.57), with a, real. For small ji it is sufficient to take ως ~W, in the 
coefficients of the equations for A. and to approximate Q by 


Q = w6 (Ko) ji. (15.62) 


The linear equations for A. then become 


dA 
eet 3 i(Q—2p) 
᾿Ξ ip{2A_+A*e ὦ" 
dA, | 
es i(2—2p) 
a ip{2A,+A*e ἀμ 


These have solutions in the form A, =a, οἰλεί, where A. satisfy 


2+ (2p — Q)A+ p(p —22) =0. (15.63) 


The small side band perturbations grow if the roots of (15.63) are complex; 
that is, if 


(0+ 7 }@<o. (15.64) 
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For this particular example, w)=(x2+ 1)'/?, wg =w9 ", So (15.64) gives 


“2 
δι 2 ee 
3 oct E\<o 


This agrees with (15.39) and (15.40), since f= ep in the comparison. For 
o >0 the side bands always remain small. For σ <0, there is instability for 
the range 


jn? «.6δ]σωξαΐ. 


Again this is a linear instability theory; the nonlinear equations (15.58)- 
(15.59) conserve energy, which oscillates between the modes. This agrees 
with the proposal that the end result is a solution with finite amplitude 
oscillations. 

It should be fairly clear that the preceding analysis, although devel- 
oped for a specific example, is general. In fact, we can recognize from 
(15.61) that p is always the Stokes correction to the frequency; in Section 
14.2 it was denoted by wa”. The expression (15.62) for is already in a 
general form. Therefore the criterion (15.64) may be written 


(2a? + τως fi? eg β΄’ <0. 


This should be compared with the radical in the characteristic velocity 
(14.21); the extra term in ji” arises from the higher order dispersion effects. 

The interaction and modulation approaches may be compared by 
noting that the modulation k=k%+k”,a=a%+a™, used to obtain 
(15.39), may be expressed approximately as 


p= : {a +a lexpi{9©+ 0 } + complex conjugate 
om 5a exp 19 + 5 aPexp 19 + 510 a exp id © + complex conjugate. 
(15.65) 
If we now take 
a Sas, 9 = kyx — (wot p)t 
for the basic wavetrain, and express the perturbations a‘?,@ as 


appropriate linear combinations of e*', we obtain the side band descrip- 
tion. 
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The interaction theory can be carried through effectively only for the 
near-linear case and only for modulations consisting of a finite number of 
Fourier components. Even so, we note the algebraic complexity compared 
with the modulation approach. This can be mitigated to some extent by 
appealing again to the variational principle. If the expression 


p= 5 DA, (deme 


is substituted into the Lagrangian, all terms except the resonant ones will 
be oscillatory in x. If these are averaged out, we may use the variational 
principle to obtain the equations for the A,. In the simpler cases, such as 
the one above, the saving is not very great. We fairly easily obtain the 
average Lagrangian 

L= 


ΜΝ non Ἢ 


>» {4,47 — ἴω, 4,A* + ἴω, A,A*} 


μι π΄ ἢ 
men 


- 2163 ΑΣΑ +24 DD A A*A 41) 


- τε [12424".45 ei + 124374 Ave (15.66) 


where the summations are over A,,A,,A_. But the remaining analysis of 
the variational equations is much the same. The main advantage is the 
canonical form. 

The interaction theory is not restricted to neighboring sets of wave 
numbers. With sufficiently general dispersion relations, one can have 
resonances satisfying (15.56) for widely differeing «,. There is then strong 
energy transfer between these modes. Such cases are discussed by Phillips 
(1967) and references given there. A problem in nonlinear optics (Section 
16.5) will provide an illustrative example. This type of interaction between 
widely different wave numbers ties in closely with the multiple phase 
solutions indicated in Section 14.9. 


CHAPTER 16 


Applications of the Nonlinear Theory 


NONLINEAR OPTICS 


16.1 Basic Ideas 


One of the most interesting areas for the study of nonlinear dispersive 
effects is in the field of nonlinear optics. The theoretical ideas fit in 
naturally with the experimental situation and are involved in the deve- 
lopment of practical devices. Modulation theory is the natural approach to 
a number of phenomena in view of the high frequencies and wave numbers 
of the basic wavetrains. The self-focusing and stability of beams are 
studied in this way. Nonlinear interactions for the production or amplifica- 
tion of sum and difference frequencies are important and can be displayed 
dramatically by changing the color of a laser beam on passage through a 
nonlinear crystal. The experiments generally seem to be more easily and 
precisely controllable than is possible, for example, in water waves, where 
the many modes of fluid motion make it difficult to isolate the particular 
effects desired. 

In the simplest formulations of the theory, the analysis is very close to 
that of the Klein-Gordon example and results can be taken over by 
analogy from that case. We start with the classical model in which the 
electric polarization of the material is represented as the displacement of 
bound electrons by the electric field. The results can be interpreted in a 
broader way later. We consider the basic one dimensional wavetrain and 
take the propagation to be in the x direction with field components FE and 
B in the z and y directions, respectively. The electron displacement is in 
the z direction and we describe it by the function r(x,7). Maxwell’s 
equations reduce to 

B,— E,,=9, 
cee (16.1) 


x? 


N 
Et fr, =coB 
£9 
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where q is the electronic charge, N is the number of electrons per unit 
volume, cy is the speed of light in vacuum, and ερ is the permittivity of free 
space. To complete the system, we need the relation of r to E. We suppose 
that an electron driven by the field E is effectively in a potential well that 
provides a nonlinear restoring force. Accordingly, the equation is taken in 
the form 


mr, + U(r) = gE. (16.2) 


If the polarization P= Ngr is introduced and B is eliminated from Max- 
well’s equations, we have 


E,,+ +P, Ξε οὐ...» (16.3) 
0 
ΡΜ (ΡΊΞΕ, (16.4) 
where 
_9N »_ Na? | 
ViP)=—-Ur), = ae (16.5) 


v, is the plasma frequency. 

Each oscillator responds also to the cumulative effect of all other 
oscillators. In the elementary treatment, this is incorporated by replacing E 
on the right hand sides of (16.2) and (16.4) by E+ P/3¢,, which is the field 
inside a spherical cavity surrounded by dielectric with polarization P. For 
our purposes we can suppose the extra term is absorbed into V’(P). 

There are important simplifications in the analysis if the potential well 
is symmetric and V’(P) is an odd function of P. We shall take this to be 
the case for the most part and indicate at appropriate places what is 
involved in the more general case. 


Uniform Wavetrains. 


In a uniform wavetrain E= Ε(θ), ΡΞ P(0),6=kx—wt. Equation 16.3 
then integrates to 


(w? — cok?)eyE = — w*P + constant. (16.6) 


If V’(P) is an odd function of P, we may take the constant of integration 
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to be zero without loss of generality. In other cases, however, the constant 
may be required and plays an important role. If V’(P) has a term in P?, 
for example, we see from (16.4) that the mean values of E and P differ by 
a term quadratic in the amplitude; hence a constant proportional to the 
amplitude squared is required in (16.6). If V’(P) is odd, however, we note 
that it is consistent to take zero mean values for E and P, and with this 
assumption the constant in (16.6) must be zero. Then we have 


ES Pp 16.7 
“0 w* — cok? US) 
*Payt V'(P) π' Ὁ P=0 (16.8) 
+ V’ + Ξε. . 
τ ωἷ-- 2k? 


In the linear case, V’(P)= v2P, the solution of (16.8) is sinusoidal in 0 
and we obtain the dispersion relation 


27,2 

n2= cok Ξ- ]-- Ὕ 

oO” — . 
Ww? ωἷ-- νῇ 


(16.9) 


Damping becomes important in the absorption band around the resonant 
frequency vy, and eliminates the singular behavior there. Away from the 
resonant frequency, however, its effect is small and can be neglected in the 
first instance. 

In the near-linear case, we may take 


V'(P)=vP—-aP?+:::, 
P=bcos§+b,cos36+--:-, (16.10) 


E=acos@+a,cos3@+-::-, 


and deduce the dispersion relation 


a (16.11) 


In the fully nonlinear case, (16.8) has oscillatory solutions, as discussed for 
the Klein-Gordon equation (14.3); the close similarity between these two 
cases becomes apparent. 
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The Average Lagrangian. 


A variational principle may be formulated in terms of a potential y, 
the z component of the vector potential. The field components are given 
by ΕΞ - ψΨψ, B= —y,, and a suitable Lagrangian is “ 


L= 5 €0(¥?- cov?) — Nqv,r+N [ + mr? — U(r) 


od Lad Dy. ς. | ftp 
= Σοί(ψ)-- cW2) WP+ 7} v(P)}. (16.12) 


If V’(P) is an odd function of P, it is sufficient for the uniform wavetrain 
to take y and P as periodic functions of 8. Corresponding to (16.7) and 
(16.8), we then have 


Ww? P 
E= = —- ------- -- 16.13 
ωψρ wo? — c2k? Eg ( ) 
22 
l p24. v(P +3 pape 16.14 


where the second equation has been ee and the constant of 
integration A enters as the amplitude parameter. The average Lagrangian 
is then obtained by substituting (16.13)-(16.14) in (16.12) and performing 
the usual manipulations. The result is 


1/2 


wp? 
eats α)- Lal Φ [24-27(P)— ee | ipa) (16.15) 


27,2 
op cok 


Again the similarity with the Klein-Gordon case, (14.26), may be noted. 
If V’(P) is not odd, the more general form 


p= Bx—yt+¥(6) (16.16) 


is required. The parameters 8 and y give nonzero mean values of B and E, 
and they must be treated as a pseudo-wave number and a pseudo- 
frequency, as explained in Section 14.7. The second constant of integra- 
tion, A say, must now be allowed in (16.6) and the triad (y,B,A) is 
analogous to the main triad (w,k,A). In the modulation theory the coup- 
ling of the changes in(w,k,A) with changes in the mean field parameters 
(y,B,A) is a crucially important effect. We shall not pursue the details 
here; they are similar to the case in water waves which will be discussed 
later. 
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General results can be derived from (16.15) as described in the 
previous chapters. However, most of the results in nonlinear optics have 
been found for the near-linear case. It has the advantage in this particular 
context that although a specific model may be used to motivate the theory, 
a broader interpretation of the formulas is fairly clear. The near-linear 
form of & is most easily obtained directly by substituting the expansions 
(16.10) into (16.12), rather than approximating (16.15). The calculation of 
° up to the fourth order in a is particularly simple since the coefficients 
a,,b,, which are third order in a, do not contribute until the sixth order 
terms. [Compare the derivation of (14.52).] To this order, then, we have 


c2k? w? — p2 
et oe eee ape Op eke τῇδ) 
ω 2 deg. 32 Eh, 


The variational equation (ἢ, =0 can be used to determine ὁ in terms of a: 


Eqv? aepr® 
pan ge gs (16.18) 
w?— ve 4 (co? — ν2) 


This result may be resubstituted in the expression for £ to give 
3,6 


c2k? v Qe ny 
[ ἰρ- τς. ᾿ fat + fe at (16.19) 
τ (w* -- v6) 


As a check we note that the dispersion relation ©, =O agrees with (16.11), 
as it should. 

The first term in (16.12), which is also equal to 4€,(E”— c2B?), is the 
basic wave operator for electromagnetic waves in free space. It always 


leads to the term 
272 
| Cok 
4 a Jee 


ω 


in 3, where a is the amplitude of the electric field. The other terms in 
(16.19) incorporate the response of the medium to the oscillating electric 
field. For other models, or to represent known material properties, it seems 
reasonable by analogy to suppose that 


1 cok? 2 2 4 
f= | 1— — > Jega*+ 8,(w) ena” + 24(w)eoa (16.20) 
ω 


for some functions g,(w),g,(w). Then the dispersion relation (Ὁ, =0 allows 
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the identification of g,,g, with the behavior of the refractive index. If the 
dispersion relation is required to be 


Ξ τς Ξ ποί(ω) + 5 πγ(ω)α΄, (16.21) 
then we must choose 
c*k? 
0 = : Ce) ἘΞ 2 |e Ἐπρ(ω)πγ(ω)εραῦ. (16.22) 


The coefficient πρίω) 15 the linear refractive index and more realistic forms 
may now be taken for it in place of (16.9). For example, to include more 
resonant frequencies v,, we have 


}ὔ 
πίω) = 1-92 >) 2 τ hol, 


ore 


where f;= N,/N is the proportion of electrons with resonant frequency ν,. 
This also fits the quantum theoretic description in which the », are 
transition frequencies and the f, are transition probabilities. Similarly, the 
nonlinear coefficient n,(w) can be chosen to represent other models or 
known material properties. However, the overall restriction to cases in 
which quadratic mean fields do not arise must be carefully noted. 


16.2 One Dimensional Modulations 


In the near-linear theory we can proceed simply as in Section 14.2. In 
optics it is more usual to consider the dispersion relation as expressing k or 
n as a function of w, so we work with w and a as the basic variables in the 
modulation theory. The dispersion relation is written* 


k= k(w) + k,(w)a’, (16.23) 


where 


Pais ote ove oe (16.24) 


*We use k,, rather than k, to denote the nonlinear coefficient, since k, will be needed later for 
the x, component of the vector k. 
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To lowest order the modulation equations, corresponding to (14.18)- 
(14.19), are 


da? 0 ; 2 
τι + { ko(w)a \=0. 
(16.25) 
dw dW da’ 
βόλια i Padicects — =(0) 
ox + ko(w) AY + k,(w) or > 
The characteristic velocities are found to be 

Ϊ ; i 1/2 
τ =kolw) + {k,(w)kg(w) } “a. (16.26) 


The equations are hyperbolic if k,kj >0 and elliptic if k,kg <0. For 
(16.11), the sign of kg is the same as νῷ -- οὖ and the sign of k, is the same 
as a. Hence we have 


Hyperbolic:  a(vg—w*)>0, 
(16.27) 
Elliptic: α(νξ-- οὗ <0. 


These results were first obtained by Ostrowski (1967). The normal case for 
optics is w*<vZ,a>0,and the equations are hyperbolic. However, 
Ostrowskii (1968) reports on experiments at radiofrequencies with ferrites 
and semiconductor diodes, where both cases can be obtained. The hyper- 
bolic distortion and elliptic instability are both found, and stable modu- 
lated forms seem to result, in line with the discussion of higher order 
effects given in Section 15.5. 

The higher order effects lead to quadratic terms in the derivatives of a 
and ὁ in the expression (16.17) for (5. The modulation equations are then 
similar in structure to those discussed in Section 15.5. Qualitatively, the 
phenomena are the same and the details will not be given for this case. The 
main results were obtained by Ostrowski (1967), and Small (1972) shows 
how the variational approach may be used. We shall, however, take up in 
the next section the analogous problem of spatial modulations and the 
self-focusing of beams. Higher order effects are important for these and we 
outline the theory. 

The fully nonlinear results corresponding to (16.26)-(16.27) can be 
taken from Section 15.2 using the appropriate Lagrangian. For the simple 
model discussed earlier this is given by (16.15). 
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16.3 Self-Focusing of Beams 


If the nonlinear term in the dispersion relation (16.21) is positive, with 
n(w) >0, then the phase speed c=w/k=c,/n increases as the amplitude 
falls off from the center toward the outskirts of the beam. Intuitively this 
suggests the beam will tend to focus. Of course the argument is only rough 
and we now consider such questions in more detail. 

For spatial modulations, we suppose that locally the wave can be 
described by a periodic wavetrain propagating in the direction of the 
vector wave number k. The average Lagrangian is determined from this, 
and in the simplest cases when pseudo-frequencies are not involved, it 
takes the form £ (w,k,a) where ὦ is the frequency and a is the amplitude of 
the electric field. In the near-linear case, £(w,k,a) is given by (16.22) with 
k =|kj. 

For solutions in which w,k,a are independent of ἡ, we have 
w=constant and the modulation equations deduced from the average 
variational principle are 


e =0, (16.28) 


ak Ok, 
~ £, =0, -- - = =0. (16.29) 


OX; 


If the medium is isotropic so that & depends only on the magnitude k of 
the vector k, (16.29) reduce to 


Pe ee 16.30 
ax, | ;P) =0, ax, ox, (16.30) 

where 
ey Ξ δ, (16.31) 


The dispersion relation (16.28) provides a relation between a and k; hence, 
in principle, p can be taken as a function of k (although it may not always 
be convenient to do the actual elimination of a). 

It is interesting to note that the equations in (16.30) are the same as 
the equations for compressible irrotational steady flow of a gas, with the 
wave number k replacing the fluid velocity vector and p replacing the 
density. The relation of p to k provided by (16.31) corresponds to the 
Bernoulli equation between density and speed in the fluid flow problem 
(see Section 6.17). In this analogy, the beam corresponds to a fluid jet in a 
gas. But care is needed in taking qualitative results directly from fluid flow, 
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because p is usually an increasing function of k in optics, whereas density 
and velocity have the opposite variation in a gas. However, the various 
techniques of finding solutions may be usefully taken over. 

The type of equations (16.30)-(16.31) governs their mathematical 
structure. This is separate from the question of whether the original 
time-dependent equations are elliptic. Moreover, the ellipticity of the 
steady equations does not indicate instability; the type affects only the 
properties of the solution and the form of the boundary conditions. 

For two dimensional or axisymmetric beams, we shall take x in the 
axial direction and r in the transverse or radial direction. The wave 
number vector will have corresponding components (k,,k,), and (16.30)— 
(16.31) become 


iad “τῇ τῆ 16.32 

ae on ee 

EU ee Oa ey a ss aa (16.33) 
Ox I or 2 r ‘ ᾿ 

p=p(k), (16.34) 


where m=0 for the two dimensional beam and m=1 for the axisymmetric 
beam. 


The Type of the Equations. 


The characteristics are most easily found by temporarily reintroducing 
the phase θ, with 


and writing (16.33) as 
k? 9’ 2k,k, p’ k? ρ’ 
Πα αι ΔΗ ΠῚ ΒΓ 
Κρ k ρ Κρ! γ΄ 
Then characteristic curves in the (x,r) plane must satisfy 


ae δ πεν k2 o! 
[τι 5]ω»- A ran (14 BE Nant mo 
p kK p Κρ 


These are imaginary and the equation is elliptic if 


’ 


ici > 0; 
ρ 
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they are real and the equation is hyperbolic if this quantity is negative. 
For the near-linear theory described by the Lagrangian (16.22), we 
have 


2 


E,C 
p=—k'£, =—Sa’ (16.35) 
2w 
and therefore 
Cok l 
aa No(w) + 7 My(w)a* 
wn, (w) 
= πρίω) + — -- (16.36) 
00 


If n,(w) >0, then p’(k)>0 and the steady equations are elliptic. We shall 
consider only this case. We note from (16.26) that the original time- 
dependent equations are hyperbolic if, in addition, kj(w) >0. 


Focusing. 


The “streamlines” determined by the vector field k are the orthogonal 
trajectories of the family of phase surfaces ?=constant. In isotropic media, 
the group velocity has the same direction as k and these streamlines are 
rays. We can picture the phase surfaces moving out along these rays and 
the question of focusing is related to convergence of the rays. To analyze 
this, it is convenient to transform (16.32)-(16.34) and introduce 
coordinates (£,7) based on the rays and phase surfaces. If we introduce 


Κιξξ» k,=&, 6=€(x,r) — ot, 
(16.37) 
pk yr" =,, pk,r™” = — ἢ.» 


the equations (16.32)-(16.33) are satisfied identically; the successive posi- 
tions of a phase surface are given by =constant and the streamlines by 
7 =constant. The relations (16.37) may be written in inverse form as 


᾿ COS X sin x 
— ᾿ ΧΞ-- ᾿ 
ξ Κ n pkr™ 
sin x COS x 
r= 9 = ; 
k ὌΝ 
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where x is the inclination of k to the x axis. Thus the consistency relations 
may be written 


0x pr” 9k ox , afi 
δὲ ΞΞ Ξ θη 9 an τι (sa) (16.38) 


If p decreases away from the axis, then so does k, from (16.36) and the 
assumption that n,>0. Therefore, from the first equation in (16.38), 
dx/0§<0; this shows that the rays bend toward the axis and the beam 
focuses. If n, <0, there is a corresponding defocusing. 


Thin Bears. 


Some interesting solutions of the equations have been constructed by 
Akhmanov et al. (1966) under the further assumption that the beam is thin. 
It is assumed that the nonlinear effects provide a small correction to a 
linear plane wave of constant wave number K and the r derivatives are of 
greater order than x derivatives due to the thinness of the beam. We take 


θ-- —wt+ Kx + Ks(x,r), 


where s, and s, are both small but s, and s? are comparable. [This can be 
done formally by choosing 


ἢ 
= — wt + Kx + Kes( x, |, 


where ε is an amplitude parameter.] With this approximation we have 


k=K(1+s,+ 5397] 


and the near-linear dispersion relation n= cyk /w=n + 4n,a* gives 


1 5. ἀν 2. 
a = 9 go (16.39) 


assuming that cK /w=n)(w). Since px αὖ from (16.35), p may be replaced 


by a” in (16.33). Under the approximation s,<s,, (16.33) may then be 
taken as 


τας ed mM, γα: - 
tS +(5,.+ : s,)a 0. (16.40) 
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Equations 16.39 and 16.40 are to be solved for s and a’. 
Near the axis we may expect 5 to be given by 


it a 


2 R(x) 


where R(x) is the radius of curvature of the phase surface at the axis. 
Surprisingly, there is an exact solution in which s(x,r) has just these two 
terms. From (16.39) we see that αὖ must also be quadratic in r, and (16.40) 
provides relations for the various coefficients. The results are 


s=o(x)+ + O(r*), (16.41) 


2 
Ss ey IY eo 

Para Π εν πε | (16.42) 

fs OO (ὦ a) 
72 ing Fx) RO) fa)” vin 

where 

ἽΕΙ. ΤῊΝ [ a | ee 0)=1: 16.44 
] ~ (m+ i)ngr peers }+ R32 ας Coy 


ro is the initial radius of the beam, a, the initial amplitude on the axis, and 
Ry the initial radius of curvature of the phase surface. 

If the phase surfaces at x=0 are plane (Κρ '=0), the solutions of 
(16.44) are 


1/2 
Ἢ -1- 95). οὐ py) ος Ξια 2 Ι64 
m=0: χ [=] aie (1—f)'?—sin-1f a (16.45) 
1/2 
ai x= (2) To (4 — 2) (16.46) 
Ny Ag 


The beam focuses and the solution becomes singular at the point where 
f(x)->0; at this point the radius of curvature R(x)-—>0 and the amplitude 
a—oo. The distance to the point of focus is 


n 2 
x4 (52 | +, for m=0, (16.47) 


n oer 
x= [8] for m=1. (16.48) 
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In the neighborhood of the singularity, higher derivatives of a become 
important and must be added to (16.39)-(16.40). As we shall see, the extra 
terms introduce dispersive effects which counteract the focusing and allow 
continuous solutions. 

Before doing this, we mention an ingenious solution of the two 
dimensional equations (m=0) that was found by Akhmanov et al. If we 
introduce 


then (16.39)-(16.40) are equivalent to 
v, + ov, — yt, =9, 1 Gee 


τ τ ets = 0, 


where we have replaced γ by y for the two dimensional case. These are 
similar to the equations of unsteady one dimensional gas dynamics, apart 
from the change in sign of y. They may be linearized exactly by the 
hodograph transformation to 


y,— 0X, — yx, =9, 
Vi Ox Tx, ΞΟ. 


Since they are linear, these equations offer the possibility of general 
solutions by superposition. However, in the particular solution referred to 
here, Akhmanov et al. presumably noted the extra convenience of 
variables 


p= xt, g=y— x, 


to write the hodograph equations 


Ὕ 
4,-- -ρ,-ῦ, q, + p, =9, 


but they then retransformed them to 


With v= πὸ ΤΞΦΣ we have 


©,0,,+y0,,=0. 
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This has a separable solution 


O= [μα + tp?)}tanh 4 


which represents a beam. In the original variables the solution is given in 
terms of parameters p and gq by 


2p 4 2 ἼΘΙ Δ" a4 
s,=o0= ——~—tanh , a =1=(4+ 1: Jsech he 
P up 
= --- = — + 
Ce, Oh eed 


In the initial plane x =0, we have 


ἐς P= "ἢ a 
5, =0, α΄ = agsech τ᾿ 
The beam starts with rays parallel to the x axis and has a realistic 
amplitude distribution. It may be shown that the beam focuses at the point 


a 6 ues (16.49) 


Χ,ΞΞΣ eoOoOr = o> = 
f 2γ},2 ἄρ 2n, 


For the present solution to agree with (16.42) near the axis, we must take 
h=r,, and we see that the focal point compares well with (16.47). 


16.4 Higher Order Dispersive Effects 


We return temporarily to the specific model with Lagrangian (16.12) 
to see the effect of higher order terms in the modulation approximation. In 
the near-linear theory the expansions (16.10) are substituted as before, but 
now derivatives of the coefficients a,a;,...,b,b3,..., are retained as ex- 
plained in Section 15.5. For steady beams, these modulation parameters 
are functions of position x only and it is clear from (16.12) that x 
derivatives arise only from the term —cgy? (generalized to more space 
dimensions). On substituting ' 
oe 


3 
sin36+ ..., 


- 4. 
y= sind + 30 


chosen to agree with (16.10), we see that the expression for the average 
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Lagrangian in (16.17) acquires the additional term 


2 


1¢€ 
“4 = 02 


The elimination of ὁ via (16.18) does not involve this term. Therefore it is 
also added to (16.19). Finally, interpreting the form of (16.19) more 
generally as before, we have 


21-2 


1 Cok 
ἐπ: mo) 3 


2 ] 4 Ϊ ECG 2 
€9a7*  - πρ(ω)ηγ(ω)ερα΄ — = —- az. (16.50) 
8 4 w? : 


The variational equations are now 


cok? ce 
δα: (na- "2a Nona’ + —a,,=0, (16.51) 
ww? 


32 
ly gee 
δ: 3 (Ka?) =0. (16.52) 
0k, 9k; 
Consistency: _ rps - ae =(. (16.53) 


The extra term in (16.51) is dispersive in nature and counteracts the 
toc USS: To focus initially, it will be nee, for the nonlinear term 
Non,a°> to dominate the dispersion term CoA, [0 ω΄“. If the beam has initial 
radius rp this gives the estimate 


2 


for the critical strength required. As the beam focuses the reduced trans- 
verse scale increases the effect of a, , and the focusing to a singular point 
is prevented in some cases. The beam might also oscillate in thickness 
under the fluctuating dominance of the nonlinearity and the dispersion. 

As a particular case, we should expect that there is a solution repre- 
senting a uniform beam with all quantities independent of distance along 
it. For a two dimensional or axisymmetric beam, the equations would 
reduce to 


k,=0, k,=k=constant, 


2 21,2 
Co m Cok 
2 3 
3. (a, + Ma) +(m- τ Ja+ Non,a” = 0. 
ω Γ ω 
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If a* denotes the amplitude at which 


eek? 
τ = nbt non,a*?, (16.55) 
G) 
we have 
α,, Ἔ Τα, = 2γ Κ α(α"2-- a?), (16.56) 


where γ5 πχ,2πρ, K*=wng/cé, as before. In the two dimensional case 
m=O, the equation can be integrated further to 


ay = yK7a"(ap— a’), (16.57) 


where ayg=a*V2 is the maximum amplitude; a* is the amplitude at the 
point of inflexion in the profile. The solution is 


a= aysech(y!/*Kay y). (16.58) 


This is the anolog of the solitary wave in unsteady problems. If a constant 
of integration is allowed in (16.57) one can obtain solutions which are 
oscillatory and periodic in y. These are the analogs of cnoidal waves. 

For the axisymmetric beam, m=1, solutions of (16.56) have been 
found numerically by Chiao, Garmire, and Townes (1964) and by Haus 
(1966). The former calculate the solution corresponding to (16.58) and it 
has monotonic decay from ay at r=0 to zero as r—oo. Haus finds 
oscillatory solutions of successively decaying amplitude and these repre- 
sent a beam surrounded by diffraction rings. Small (1972) notes that 
(16.56) in normalized form, with @=a/a*,7=rKa*(2y)'/*, is associated 
with the variational principle 


7 Ι 
5 f (a+ α᾽- 7a*)ar=0, 
0 


and he uses a Rayleigh-Ritz procedure to show that 


@ = 0,8488e ~ 97495" +. 1.3156¢~ 118107 


is a good approximation to the Chiao, Garmire, Townes solution. In these 
dimensionless variables the strength required is 


CO 
— =2- yo rer 
pay a‘r dr = 1.86. 
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Thin Beams. 


The thin beam approximation to (16.51)-(16.52) is obtained under the 
same assumptions that led to (16.39)(16.40). There is now an extra second 
derivative term added to (16.39) and we have 


n 
K?(2s.+s?)a= — Ka) +(a,,+ 7 a,), (16.59) 
0 
bak 4 5 88" + (s +s \a?=0. (16.60) 
0 “ δ τὼ ' 
If we take 
Vv=ae'™, 


the two equations combine into 


0 


2iK ἘΝ v+— - Κ᾿ - 0, (16.61) 
where 
07 moa 
oe m oO 
Vi ar? r or 


This nonlinear Schrodinger equation has a certain canonical structure, in 
the same sense as the Korteweg-deVries equation, and arises in a variety of 
different problems. Surprisingly, a wide class of exact solutions can be 
derived for the two dimensional beam (m=0) by the same method devel- 
oped by Gardner, Greene, Kruskal, and Miura (1967) for the Korteweg- 
deVries equation. This was pointed out by Zakharov and Shabat (1972), 
who go on to give a thorough analysis of the equation. An account will be 
given in Chapter 17. 

By the time the various approximations have been made it is simpler 
to derive (16.61) directly from Maxwell’s equations with an assumed 
nonlinear relation between P and E. For the two dimensional beam, we 
have 


| Oe ieee P= cE, et £,,) 
and add 

P=(n2—l)egE + πρπῃερα Ε 
to give (to a sufficient approximation) 


2 2 πε ae 
Π0Ὲ,, + Nona Ly, — 6( E,, + E,, )- 
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Then if 


E= SY (x,y)eiR te 5 UM(x,y)e Metin, 


we have 


yy" 


n 
— K7}¥P¥= —2iKV,-V,-—Wv 
0 
If V... is neglected, then (16.61) follows. 


16.5 Second Harmonic Generation 


One of the spectacular experiments in nonlinear optics is the produc- 
tion of a blue beam from a red beam on passage through a nonlinear 
crystal. This is a good example of the production of second harmonics due 
to nonlinear effects and the theory is in the spirit of the general discussion 
in Section 15.6. The experiment was first performed by Franken, Hill, 
Peters, and Weinreich (1961). A full account of the theory is given by 
Yariv (1967, Chapter 21). We note the main points briefly. 

In this case the appropriate nonlinear effect is a quadratic dependence 
of P on E, and it is assumed that the components are given by 


P,= (np — leg E, + dy, E,E,. (16.62) 
Ammonium dihydrogen phosphate, for example, exhibits this effect with 
d;, nonzero when i,j, and k are unequal. The anisotropy of the relation 
corresponds to the anisotropy of the crystal. It would be modeled by an 


unsymmetric potential well with a term 


V(P) x P,P, P, 


in the three dimensional version of (16.4). In general, n) depends on the 
frequency w due to the dispersion, but in many cases the d,, are indepen- 
dent of w. 

Maxwell’s equations may be reduced to 


0°E, 
912 


0°P, 
1 = aVE, (16.63) 
Ey Of 
but a little care is needed in using (16.62) directly when a number of 
interacting modes of different frequencies are involved, because of the 
dependence of ng on w. However, if ῥ᾽ is split into two parts P,= P/ + P.”, 
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where P/ refers to the linear part and P.” to the nonlinear, we know that 
for any particular frequency 
1 | 9 Ὲ, 1 OP; )- new? 


912 912 ce 


E,=—kE,, (16.64) 


where k(w) is the corresponding wave number for linear waves. We now 
take a number of interacting plane waves whose y and z components are 
given by 


E,= ἘΣ A(xexp(ik, τω), 
where a=+1,+2,..., 
k__=k,, ω. ,Ξ πω,, AW P= AM 


and k,,w, satisfy the linear dispersion relation. On substitution in (16.63), 
we have 


δ ik dA Ϊ 44:5 ike . 82 PY 
~ l eae 2 ἡ exp(i aX at) = bom 

The nonlinear term P” produces modulations of the amplitudes A™. The 
modulations are assumed to be slow compared with the wavelength and 


the second derivatives with respect to x are neglected. Assuming that 


= ἐκ Ἐκ: 


ijk 
we have 


dA‘ 
Sd) tke oP exp (ik, x — iw,t) 


μ 
--ῷ Σ (cog +0,) dy ASA Mexp { i(kg+ k,) x — Ἰὼ, Ἐὼ.}}}. 
»Y 


(16.65) 


For three interacting waves with frequencies satisfying the resonance 
condition 


W, +, = 03, (16.66) 
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the terms proportional to εἰν give 


dA : ε _ MoCo 
dx 250 


-τ-π ὦ 4545 *exp { i(k; —k,—k,) x}. 
Similarly, 


442 — ioc 
τς = an 24,4} 4. 5 exp { i(k3—k,—k,) x}, 


dA® ἧμς 
i 2 FRG w,d 


ne oR ieAy 41 exp { —i(k3—k,—k,)x}. 


If we assume that A® =0 initially at x =O and that the primary waves 
A“ A® are very little depleted by the interaction, we may take A“, A 
constant in the equation for dA® /dx and obtain 


(1 ΝΕ πρῶ 
Ak 


A) = ἔθ κά Aa Ak=k,— k,— ky. 


ij 
The amplitude is proportional to (sin} xAk)/4Ak. If the interacting waves 
satisfy the resonance condition 


exactly, 4® increases linearly with x at first, but the other interaction 
equations (reducing A,,A, as A, builds up) must then be included. The 
energy oscillates between the interacting modes. Eventually, loss of energy 
to higher harmonics and dissipation of energy must be incorporated. 

In the case of second harmonic generation, the second harmonic is 
produced by self-interaction in which 


W) = W, =, W, = 2). 


In normal circumstances, however, Ak = k(2w)—2k(w)*+0 due to the dis- 
persion, and the second harmonic generation would remain fairly small. To 
impove on this and obtain true resonance, Giordmaine (1962) and Maker 
et al. (1962) proposed the ingenious device of using birefractive crystals 
(described in Section 12.8) to match an ordinary ray at frequency w with 
an extraordinary ray at frequency 2w. The matching condition 


k© (20) -2k©(w) =0 
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No (w) κ᾿. 


Ne (2w) 


optical 
axis 


Fig. 16.1 Index matching of ordinary and extraordinary rays. 


is equivalent to 
(e) - η(0) ΞΞ 
n? (2ω) -- π΄“ (w) =0, 


where subscripts 6 and 0 indicate extraordinary and ordinary rays, respec- 
tively. The variation of n©(2w) and n©(w) with the angle from the optical 
axis is as shown in Fig. 16.1. The vector k as shown gives the required 
direction for resonance. For a ruby laser beam (A =6940 A) in a crystal of 
potassium dihydrogen phosphate the angle is 50.4°. Full details as well as 
alternative possibilities are given by Yariv (1967). 

With these improvements of the resonance condition, the experiments 
gave beautiful confirmation of the theory. A striking photograph taken by 
R. W. Terhune is reproduced in the frontispiece of Yariv’s book. 


WATER WAVES 


16.6 The Average Variational Principle for Stokes Waves 


We now apply the variational approach to some problems in the 
theory of water waves. The variational principle is given in (13.16)-(13.17) 
of Section 13.2, and the approximate developments of Stokes and Korte- 
weg-deVries, backed by subsequent mathematical existence proofs, assure 
the existence of periodic dispersive wavetrains. Since g is a potential and 
only its derivatives appear in the Lagrangian, the most general form for a 
periodic wavetrain is 


p= Bx—yt+ Φ(θ,»). θ-- Κχ — wt, 
ηΞ Ν(ΘῚ, 


(16.68) 


where ®(6,y) and N(@) are periodic functions of 9. The parameter β is the 
mean of the horizontal velocity y,, and y is related to the mean height of 
the waves. In the uniform case, a frame of reference can be chosen in 
which B=0 and the mean height is zero. This was done in the earlier 
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discussion [see (13.120) and (13.121)]. Notice that y#0 even for this 
choice. In the modulation theory, changes in the mean velocity and the 
mean height are coupled with changes in the amplitude. Accordingly 6, y, 
and a related parameter for the mean height must be left open. The 
nonlinear coupling of amplitude modulations with mean velocity and 
height is an important physical effect and the mathematics fits it quite 
naturally. It is the prime example of the situation noted in (14.62) and the 
following discussion of Section 14.7. 

In the lowest order modulation approximation, the average 
Lagrangian is found by substituting the periodic solution (16.68) into the 
expression (13.17). In the first instance we shall consider a horizontal 
bottom and choose the origin y=0 at the bottom so that hy=0. We have 


l 20 
e = τὴ L do, (16.69) 
where 


— Ι 2 1 
ρ[γεωφ,- 5 (B+ k®y)— 5 03— ay | Φ 


᾿ 
oo 


- 4p)w— 4 pgn?+(w-Bk)p [Pod 
2 2 πε 


l 

| 
.«----- 

- 


Ν 
i [ (53+ ΣΦ). (16.70) 
0 


Since exact expressions for ®(8,y) and N(@) are not known, further 
progress is made by adopting either the near-linear expansions of Stokes or 
the long wave theory of Boussinesq and Korteweg-deVries. We pursue the 
Stokes development. The periodic functions ®(0,y) and N(@) are expanded 
as Fourier series in the form 


A 
(0,y) = Σ — cosh nky sin nd (16.71) 
1 
N(0)=h+acos@+ >\a,,cosn6. (16.72) 
2 


The main parameters will ultimately be the triads (w,k,a) and (y,f,h); a is 
the amplitude parameter and h is the mean height of the surface. It may be 
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assumed in advance that the coefficients a,,A, will be O(a”) for small 
amplitudes; (16.71)-(16.72) are substituted in (16.69)-(16.70) to obtain an 
expression for & to any desired order in a. The main interest is in the first 
nonlinear effects, which are of order a* in £&, so it is convenient to 
calculate & up to this order. The coefficients A,,A,,a, appear in the 
expression, in addition to the two main triads, but they may be eliminated 
by solving the variational equations 


e,=0, €,=0, £,=0 


for A,,A,,a, and resubstituting the results in Ὁ. These steps are tedious but 
unavoidable, whatever approach is used. It is, on the whole, a little simpler 
to obtain the relations for A,,A,,a, from the variational principle, rather 
than directly from the equations as outlined in Section 13.13. This route 
also has the advantage that (5 is determined once and for all, and all other 
quantities such as mass, momentum, and energy flux are simply derived 
from it without repetition of similar algebraic manipulations. 
The eventual expression for £ is 


(w~ βκ)" 
pee, 2 = 
is oy 3B?) zoek? + ΣῈ een ΠΝ 
1 k?E? rte | , 
~ a) 22 ΣΎ}; 16.73 
2 08 874 oe ee) 


where 
oe + pga’, T= tanhkh. 


It will be seen below that E is the energy density for linear waves moving 
into still water; it becomes a convenient amplitude parameter in place of a. 
In general, changes of the mean quantities (y,B,h) are coupled to the 
wavemotion and it will be seen that the changes are O(a’). It is therefore 
sedate to replace A by the undisturbed depth hy in the coefficient of the 
term in a* and replace T by 


T)=tanhkh, 


in that term. It is important, however, to keep ἡ in the earlier terms. In the 
original derivation of the expression for © (Whitham, 1967), it was 
supposed that y and 8 were O(a’), since the case of interest there was 
propagation into initially still water. But (16.73) is in fact true without that 
restriction. The extension allows the study of waves on currents, for 
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example, where the changes in B due to the waves will be O(a’), but β 
itself will include the nonzero undisturbed value of the stream velocity. 
There is some slight difference in the form of £ depending on the 
choice of the zero level of the potential energy. In deriving (16.73) from 
(13.17) the zero level was taken at the bottom, assumed to be horizontal. If, 


more generally, the mean surface is y=b and the bottom is y= — Apo, the 
term 4pgh? in (16.73) is replaced by 

Ἢ πος ee ene | 

5 pgb 5 pgho; (16.74) 


the other terms are the same with h=h,+b. This modification must be 
used when the bottom is not horizontal and therefore no longer available 
as the reference level for potential energy. We shall take the bottom to be 
horizontal and use (16.73) unless the contrary is stated specifically. 


16.7 The Modulation Equations 


For a modulated wavetrain the term Bx—yt in (16.68) must be 
replaced by a pseudo-phase (x, ἢ) and y,f defined by 


Y= —V, B=y,, 


just as kx—wt is replaced by the phase @(x,1) (see Section 14.7). The 
average variational principle 


ὃ f { £(w,k, E; y,B,h) dx dt=0 (16.75) 

is to be used for variations in 6£,66,d6h, dy, and we have 
OE: β, =, (16.76) 

. ὥς δ τς, te Oi. 
δθ: > ὯΝ ay Ἐν Υ̓ + 5χ 0, (16.77) 
δ: eC, Ξ0, (16.78) 

᾿ ὃ Ofc 0B oy © 
dy: ar cy Ax CB =(), δ, + ax =0. (16.79) 


The dispersion relation Ὁ, =0 gives 


(ω-- Bk): 9TS—-1072 +9 2 
gktanhkh — 4Té τι} (16.80) 
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in agreement with (13.123). The companion relation ©, =0 gives 


2 
lp 1[1-To ΚΕ 2 
y= 7B +ah+$| T, ; + O(E*). 


Since y= —y,,B=y,, this is a Bernoulli type of equation for the mean flow 
potential ψ, modified by the wave contribution proportional to a”. In such 
relations it seems to be convenient to express coefficients depending on 7, 
in terms of 


Wo(k) = (gk tanhkhy) ’”, Cok) =(gk7! tanh kh,)'””, 


Cpe) ee 
lk) = 5 col k) sinh κῆρ |” 


the linear values for waves moving into still water of depth ho. We have 
en es eco 9 2 
ὙΞ. Ὁ + eh+ 3 ; "τος ). (16.81) 
16.8 Conservation Equations 


The wave action density and flux in (16.77) are 
_ E(w—Bk)_ κα 


a τανε ς PO ΠΕ 2 
Ξε. τ {Ε}} (16.82) 


EB(w— Bk) 1 E(w-Bk) 4 
σον ee a ὦ 2 
* “gktanhkh ” 2 (oxtannkny? ae δὴ OF) 


E 
=~ (B+ Co) + O(E’). (16.83) 
0 
These take the usual form in terms of the energy density Ε. 
Mass Conservation. 


The companion quantities in (16.79) are 


© =ph,  —&,=phB+ = + O(E?), (16.84) 
0 
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We see therefore that the first equation in (16.79) is the equation for 
conservation of mass and that the waves add a net contribution E/c, to 
the mass flow. It is particularly valuable, then, to introduce the mass 
transport velocity defined by 


Ε 


Se ear 


(16.85) 


Energy and Momentum. 


The energy density and flux, defined in (14.74), are found to be 


wh, +L, - δ = 5 phU?+ + pgh?+ E+ O(E), (16.86) 
ρ ρ μὰς ] 2 2Co 1 2 
(16.87) 


The momentum density and flux [see (14.75)] are found to be 


k&,+ BL, =phb+ Ξ =phU+ O(E2), (16.88) 
0 


2, 1 2 2Co 1 2 
—k&, — BL, + & =phU?+ 5 pgh’+| —— — 5 E+O(E?’). (16.89) 
0 


The simplicity of form obtained by introducing U in place of β is to 
be particularly noted. For then the contributions of the mean flow, the 
waves, and their interactions are clearly seen. From (16.86), E is confirmed 
as the energy density contributed by the waves. The wave momentum 
E/co in (16.88) takes the usual form, but the complete term is also 
conveniently written as phU. The expression for the momentum flux 
(16.89) contains thé interesting term 


“-[33- 1}ε (16.90) 


which was first pointed out and exploited by Longuet-Higgins and Stewart 
(1960, 1961), who refer to it as radiation stress. It should then be noted that 
it contributes a rate of working US in the energy flux (16.87); this is a wave 
interaction term in addition to the usual energy flux (U+ Co) E. Whenever 
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a mechanical system is described relative to a frame moving with its center 
of mass, the total energy is 


5 > m(U+e)' = SU? > m+ US) mv + ᾿ Δ mv’, 
the middle term being U times the relative momentum; the three terms in 
- (16.87) are the counterparts of this simpler case. 


From the various expressions obtained above, the original equations 
(16.77)-(16.79) may be written 


0, E 0 Ε dk . dw 
+ pens + ——— = —_ + a Ες 7 1 


dt \ ὡρ Or 
ὃ ὃ Ε 0B doy 
ess peri —— sf = pear —_— = ὡ 2 
Ot es Ox (pnp+ ΕἸ ©, ot ag Ox υ (16.92) 


where w and y are given by (16.80)-(16.81) and terms O(E”) are omitted. 
An alternative set, introducing the energy and momentum equations in 
place of wave action and the consistency equation between B and y, 15 


k, +, =0, (16.93) 
(ph), + (phU),, =0, (16.94) 
(phU),+ (pnu?+ + pgh?+ 5) τὸ (16.95) 
3 hU2+ ρει! ΕἸ + | ohu( 5 U2+ ah) + US+(U+C,)E | =o. 
(16.96) 


In the early work concerning waves on currents there was some 
question about the correct form for a “wave energy” equation for E. One 
way of deducing the correct form is to eliminate A and U as much as 
possible in (16.96) from the preceding equations. It is easily found that 


E,+{(U+C,)E},+ SU, =0 (16.97) 


is correct. The need for the extra term SU, was pointed out by Longuet- 
Higgins and Stewart (1961). Of course (16.97) is equivalent to the wave 
action equation in (16.91), which appears to be more fundamental in the 
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present approach. To see the equivalence we note that the wave action 
equation can be expanded to 


E,+{(B+C))E},+ 


The coefficient in square brackets is equal to (2C,/cy—4) and B= 
U+O(E), so the two agree. 


16.9 Induced Mean Flow 


Equations 16.92, or alternatively the pair ( 16.94)-(16.95), can be 
viewed as determining the changes in ἢ and U induced by the waves. These 
are the long wave equations [see (13.79)] with the additional wave term S. 
We are concerned here with the changes in h and U induced by the 
wavetrain. For waves moving into still water of depth ho, we may suppose 
U and b=h-— hy to be small and linearize the equations to 


b +h U, =0, 
(16.98) 


ale τ. 


For many purposes it is sufficient to take S as a known forcing term 
already determined from the linear dispersive theory for the distributions 
of k and E. Since 


k,+Co(k)k,=0, £,+(C,E),=0 
in that theory, and since S is of the form f(k)E, it follows that 
{ 8(K)S },+ { 8(k) CoS },=0 


for any function g(k). It is then easily verified that a solution of (16.98) is 


ho S 
b=h—-h,= —- ——— : >, 
© ρἤρ-- C2(k) ρῆρ 
ous (16.99) 
(=f Se {Ὁ 


To these may be added the solutions of the homogeneous equations, that 
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is, functions of x+ V ghy t. It is clear from (16.99) that the group velocity 


and the long wave velocity V gh, should not be too close compared with 
a”. But this is required for the validity of the Stokes expansion; in the limit 
C2—>gho, the Korteweg-deVries development is needed. 

In starting up a wavetrain, transient long waves will be propagated 


with velocities + Vgh,, but (again assuming C) and V gh, are well 
enough separated) the mean flow and mean height accompanying the 
wavetrain are given by (16.99). The starting transients set up on moving an 
obstacle through water are discussed in detail by Benjamin (1970). 


16.10 Deep Water 


For deep water, kh )>1, the induced changes in A and B become 
negligible. This should be expected in advance, but it is confirmed exphi- 
citly by (16.99). The average Lagrangian (16.73) becomes 


me l w? l k2E? 3 
ὃν τα (5 ἡ 5 ag OD: (16.100) 


There is no interaction between mean flow and the waves described by 
βιν. As far as the waves are concerned, we may work entirely with Cy it 
fits the simple form of earlier problems, where pseudo-frequencies do not 
arise, and w,k,E are the only wave parameters. The dispersion relation 
from ὃς =0 is 


2 
πράτ KE ...] 
pg 


= gk(1+k’a?+---), (16.101) 


in agreement with earlier results. The modulation equations for E£ and k 
are given by (16.77). 

On a given current U,, the preceding arguments refer to β -- Up rather 
than β itself and, in the limit of deep water, Cw is modified to 


1 k*E? 
———=_ + O(F?). 16. 
γ πε FOCE*). (1602) 


ΒΕΓ ἢ (ω-- οι)" 
tw ὩΣ E- 
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16.11 Stability of Stokes Waves 


For deep water waves, the simple theory of Section 14.2 applies. From 
(16.101), 


wo(k)= (3), (k= (gk)! 


The quantity wow,<0; therefore modulations grow in time. For finite 
depth, the coupling with the induced mean flow becomes important and 
has a stabilizing effect. 

The stability is decided by the type of the full set (16.91)-(16.92), 
which is a fourth order system for k, E,B,h. The type in turn is decided 
from the characteristics. It is straightforward, but lengthy, to find the 
characteristic velocities directly by the standard method. The analysis can 
be simplified and given a more significant form by breaking the argument 
into two parts. First, (16.99) is a sufficient approximation for the relations 
of h,B to E. At the same time, this first step brings out the result that two 


of the characteristic velocities are + V ghy . The expressions for ἡ, may 
be used in the equations (16.91) for k and E to determine the other two 
characteristic velocities. 

With b=h-— hy and B taken to be O(E£), the dispersion relation (16.80) 
may be approximated by 


1 \b . 9T¢-10T24+9 KE 
w= wo +k τας τ": Ξο- οὐ δὲ: 
Wo B 0 2 0 h 873 pCo ( ). 
From (16.99), this reduces to 
w= πω) τῶ) <= + O(E?), (16.103) 


where 


_9TE-10T2+9 1 (2Co~(1/2)eo)” |, 
ghy— C5 | 


Since ὁ,β have been eliminated, we now have the simple modulation 
equations for k,E of the type discussed in Section 14.2 and can read off 
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the result for the characteristic velocities with no further calculation! The 
characteristic velocities are 


(16.104) 


Here wy=( gk tanhkh,)'/? and wy is always negative. Thus the characteris- 
tics are imaginary for 02, >0 and real for 2, <0. The formula for 2, shows 
clearly the stabilizing effect of the mean flow as khy decreases from the 
deep water limit. The critical value for stability is determined by the value 
of khg for which Q2,=0. This value is found numerically to be kh)= 1.36. 
For kh, > 1.36 modulations grow; for khy< 1.36 they propagate in typical 
hyperbolic fashion. 

The instability for deep water waves was first deduced by Benjamin 
(1967) by the Fourier mode analysis described in Section 15.6. This was 
then realized to be the significance of the elliptic modulation equations 
and the critical value khy= 1.36 was deduced for the finite depth case. The 
value was then confirmed by Benjamin using his Fourier mode approach. 
This sequence shows the valuable interplay between the two approaches. 

It should again be remarked that the “unstable case” refers to growth 
of the modulations and not necessarily to chaotic motion. To assess the 
eventual behavior, higher order dispersive terms must be included as 
described in Section 15.5. From that analysis we infer that the next stage 
would be the development of modulations where the envelope of the 
wavetrain is a sequence of solitary waves. 


16.12 Stokes Waves on a Beach 


For a wavetrain approaching a beach, we may take the modulation 
parameters to be independent of ¢. From the modulation equations, we 
then have the four relations 


ω, eit Y; — £, =constants 


to determine k(x), E(x), B(x), h(x) in terms of their original constant 
values out at sea and the depth distribution h,(x). To the lowest approxi- 
mation, the first two relations are 


w= Ww) =( gk tanh kh)’ * = constant, 


E (16.105) 
~f, =—C,=constant, 
Wo 
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which are sufficient to determine the distributions of k(x), E(x) in terms of 
the depth distributions ho(x). Since ὡρ is constant, the relation for E can 
also be interpreted as one of constant energy flux ECo, but it now seems 
that wave action is more fundamantal for such “adiabatic” processes. The 
relations y=constant, — £ g =constant determine the accompanying small 
changes in h—h, and β. The results are 


(16.106) 


[The slight modification noted in (16.74) is used in calculating y.] There is 
a depression of the mean surface and a countercurrent to balance the mass 
flow induced by the waves. 

On a shelving beach the amplitude increases as the depth decreases. 
At sufficiently high amplitudes, which can be variously estimated as 
a/X\=0.142 from Michell’s deep water calculations or a/hy=0.78 from 
McCowan’s solitary wave estimates, the waves peak and the Stokes theory 
ceases to apply. 


16.13 Stokes Waves on a Current 
A similar discussion applies to waves propagating along a nonuniform 


current U,(x), which may be assumed to be due to variations in the depth 
h(x) or fed by upwelling from below. In this case, we have 


w= kU,( x) +W(k) = Κυρί(χ) + { gk tank kho(x) } '/? — constant 
(16.107) 


for the determination of k(x), and 


- δε τῷ (Uolx) + Co(x) } = constant (16.108) 


for the determination of amplitude. For deep water 


1/2 8 
Wy = (gk) / : co=(¥ 5 (Ὁ 5 60 
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we may express the results in the form 


kU,+ (gk) '/? — constant, 
(16.109) 
Eco(2 U + co) = constant. 


These were first found by Longuet-Higgins and Stewart (1961) after 
detailed direct analysis of the equations of motion. For waves moving 
against a current the result is singular, predicting E->oc, when the magni- 
tude of the group velocity equals the stream velocity. At this stage the next 
order terms of order Εὖ become crucial and ensure a finite result. This 
question has been studied by Crapper (1972) and Holliday (1973). 


KORTEWEG-DEVRIES EQUATION 


To conclude this chapter, the modulation theory is developed for the 
Korteweg-deVries equation. There are a number of special features in the 
derivation, and some nontrivial tricks were needed to get exact formulas 
for the characteristic relations. Because of the central position of the 
equation in the subject and the possible tie with further developments in 
the exact analysis of the equation, it seems worthwhile to document these. 


16.14 The Variational Formulation 


It is convenient to choose the particular normalization in which the 
equation becomes 


7, + Onn, + ἡ...» =0. (16.110) 


There is no variational principle for the equation as it stands and 
some potential representation is required. The simplest choice is n= φι; the 
equation becomes 


φ, + OD, Dye + Prxxx =9 (16.111) 
and a suitable Lagrangian is 
aot! quo 
L= ~ 7 PPx~ Px 5 Pxx- (16.112) 


One could also introduce x=9q,, in (16.111), work with a pair of functions 
φ, x, and use a Lagrangian involving only first derivatives. That has the 
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advantage of fitting the general discussion in Section 14.7, but it is simpler 
to work with (16.112) and add a few special manipulations. 

The need for a potential representation is basic to the structure of the 
problem and affects the number of parameters in the uniform wavetrain. 
We must take 


p=~t+O(0), w=Bx-yt, O=kx-ot. 
Then 


n=B+k®,, (16.113) 


and the parameter £ refers to the mean value of ἡ. In terms of ἡ the 
uniform wavetrain solution of (16.110) is given by 


knoee + 67% τὰ τῆν =0, 


and there are two immediate integrals 


ω 


Κ’ηρρ + 3 ἘΞ k 


n+ B=0, 


k7n2 +23 - τη +2Bn-2A =0, (16.114) 


where A,B are constants of integration. For this solution, the Lagrangian 
(16.112) may first be expressed in terms of ἢ as 


. tf, ὦ 1 W233 1 2 
and then, from (16.114), in the equivalent form 


πε ληε B+ 5 - 2) }n—A (16.115) 


We now require the average Lagrangian 


7 l 27 
e = τ 1,40. 
From (16.113), 
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from (16.114), 


l 27 4 _ 1 = kw 
= i: ng dO = = - Pk'ng dn 


where* (16.116) 


U= 
Finally, then, 


€ =kW(4,B,U) + BB+ > By- 5 UB?—A. (16.117) 


The variational equations for the triad (y,8,B) are 


5B: B=—kWs,, 


. ϑ(1ρὴ.. 9 (yg 1,_p)\= 
ss 2 (58) + Ὁ [Ὁ 27 ΒῚ " 
᾿ 0B oy 
Consistency: τ, + re =0. 


From the last two we may take y = UB— B, without loss of generality, so 
we have 


B=-—kWs,, y= —kUW,- B, 
and 
3 (Kw,) + © (kUW,+B)=0. (16.118) 
or BY ax Ξ ' 
For the triad (w,k,A) it is convenient to replace the variational equation 
for 56 by the momentum equation 
9 0 
5, (KEL + BEY) + 9x 6& -κῇ, — BL,)=0. 
We have 
5A: kW, =1, (16.119) 


*The symbol U is now used for the nonlinear phase velocity as opposed to its use for the mass 
flow velocity in Sections 16.9-16.13. 
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Momentum: τ (kWy)+ το (kUWy-A) =0, (16.120) 


Consistency: CL ὃ (Ku) =0, w=ku. (16.121) 
ot Ox 
Equations 16.118, 16.120, and 16.121 may be viewed as three equations for 
A, B,U, with k given by (16.119). A more symmetric equivalent form is 


OW; as 08 


a ee hee 

IW, OW, ΘΑ 

5 + U— -ν, 4 =0, (16.122) 
OW, OW, dU 

i Oe aoe 


In terms of these variables, the wave number, frequency, and mean value 
ἢ = B are given by 


W 
Og. eee (16.123) 
A 


The amplitude a is obtained by relating the zeros of the cubic in W to the 
coefficients A,B, U. The natural choice of basic parameters in the physical 
description would be k,f,a; the trio A,B,U is an equivalent set. The 
dispersion relation w=w(k,8,a) is provided implicitly by the second ex- 
pression in (16.123). 

The equations in (16.122) have a reasonably symmetric appearance, 
whereas the original variational equations in B and y look awkward. This 
seems to be associated with the hybrid nature of the Korteweg-deVries 
equation as an approximation to the original water waves formulation. In 
deriving the approximation, the fluid velocity is expressed in terms of the 
depth [see (13.102)], so that the triad y,8,B becomes intermingled in an 
unsymmetrical way. For instance, β is introduced as the mean height in 
(16.113), yet its natural role is that of the mean fluid velocity. This duality 
is smoothed out in the more symmetric form (16.122). A related point is 
that because of the potential representation, we have to work at first with a 
fourth order system; the more balanced form is recovered only when we 
revert to a third order system. 
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16.15 The Characteristic Equations 


The system (16.122) is hyperbolic in general and we now consider the 
characteristic equations. The function W and its derivatives W,,W,,Wy 
can all be expressed in terms of complete elliptic integrals and the 
characteristic form of the equations could be ground out directly, but with 
considerable labor. Surprisingly, however, if the zeros p,q,r of the cubic 


1-5 Un?-+ Βη- A=0 (16.124) 


are used as new variables in place of A,B,U, and if various (nontrivial) 
identities among the second derivatives of W are introduced, the equations 
may be put in a simple form from which the characteristic relations and 
velocities are seen immediately. It turns out that the equations may be 
written 


(qtr),+ P(q+7),=9, 


p(W,- W,)+ q(W, - W,)+r(W,— W, 


) 
q ἢ 
Ρ- +q+r)- 16.125 


plus similar equations for r+p and p+q in cyclic permutations. Thus the 
Riemann invariants are 


qtr, rtp, » τα, (16.126) 


simply, and the corresponding characteristic velocities P,Q,R are the 
coefficient in (16.125) and its permutations. 

At this point it is useful to express the quantities concerned in terms of 
elliptic integrals. We introduce 


got, eee. Sager (16.127) 


a is an amplitude variable and s is the modulus of the elliptic integrals. 
Then it may be shown that 


Ds) 
a ae δ: 


where D(s),K(s) are the complete elliptic integrals in standard notation 
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(Jahnke and Emde, 1945). If we rather use B,a,s as basic variables, we 
have 


= OE aa a zs »ι 
p=B+2a—, q=B+2a( 2 1), r p+20(2 5} (16.128) 
The wave number and phase velocity are given by 

ΠῚ πα" | 
ae ie ae (16.129) 

2 
U=2=2(pt+qtr)=6p+4a( 22-18) (46.130) 

k K e 


The Riemann invariants and characteristic velocities are as follows: 
Riemann Invariant Characteristic Velocity 


qtr P=U- —— 


r+ = U— —____ 
ἥ s*(K—D) 


4α(1] -- 5 Κ 


+ i ωπαακρικιις τς τς τος 
ΕΝ s*(s*D -- K) 


In general the velocities P,Q,R are distinct and P<Q<R. Thus the 
system is hyperbolic. The limits s7->0 and s?-s1 are both singular in that 
two of the velocities become equal. The limiting equations are then not 
strictly hyperbolic, although, due to the uncoupling of one of the equa- 
tions, they may still be solved by integration along characteristics. This is 
the situation encountered earlier in the linear theory, to which the limit 
s*-»0 corresponds. 


Small Amplitude Case. 
If we take a->0,5*-0 but keep the wave number k given by (16.129) 
finite and nonzero, we have 


4q'/2 
So ζ ς 


In the extreme limit 52-»0, we find 


P,0>68—-3k2, R368. 
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The linear theory would neglect changes in β and we would have the linear 
group velocity —3k? as a double characteristic. In the next order correc- 
tion, the near-linear theory, we find 


2 
P~6B—3k?—3a+ (5). 
2 
O~6B —3k?+3a+ o{ $ (16.131) 
az 


The corresponding approximation in the original equations gives the set 


p,+6p8,+| 54°] το, 


k 3 3 a’ es 
,+|6Bk—kKe+] 5 |] =0, (16.132) 


(a*), + {(68 —3k?)a?\ +6a*B,=0. 


The terms in square brackets are the near-linear corrections to linear 
theory. In the linear theory, the equation for B uncouples and may be 
solved independently; it provides the characteristic velocity R=6. Usu- 
ally, however, the solution B=O is appropriate and we have the usual 
modulation equations for a and k. The near-linear corrections introduce 
the important qualitative changes that make the system genuinely hyper- 
bolic and split the remaining group velocities. The modification of the 
equation for B is crucially important. If only the correction to the 
frequency (in the second equation) were introduced, the pair of equations 
would have imaginary characteristics and would appear to present a case 
of instability. In the terminology of Section 14.2, we have wy=—k’, 
W,=3/2k, wow,<0. But the coupling with β stabilizes the modulations 
and the full system is hyperbolic. We can simply calculate the characteris- 
tics of (16.132) and check (16.131), but it is again instructive to take the 
approach used in Section 16.11. If B is entirely induced by the wave 
motion, we may use the second and third equations in (16.132) to show 
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that (αὖ), =(a*/3k?),, to lowest order. Then from the first equation 


PO oe 


After this is substituted in the second equation the effective change in 
frequency is 


The equations for a and k are now hyperbolic with characteristic velocities 
—3k*+3a in agreement with (16.131). 
16.16 A Train of Solitary Waves 


In the other limit s*->1, the wavetrain becomes a sequence of near- 
solitary waves. In this case, K and D are asymptotically given by 


K=A+O(1-s’), D=A-—O(l1-s’), <A=log 
(1 -- 


In the solitary wave limit it is natural to take the amplitude to be the height 
of the crests above the troughs and to take the wave number to apply to 
the number per unit length (rather than number per 277). Accordingly we 
introduce 


a,=2a, Κιξ ς---. 


Then from (16.129) we have 


1/2 
= (2a)) : 


er Vs (16.133) 


Errors of order 1 -- 52 are exponentially small in (2a,)'!/2/k, and we work 
to that order. Then 


p~B+a,—-2k,2a,)'"; — q,r~B—2k,(2a,)'"; (16.134) 


Sec 16.16 A TRAIN OF SOLITARY WAVES 573 


U~6B +2a,-12k,(2a,)'”’; (16.135) 


> 


i=36@a) 
Ρ--6β-- 16k,(2a,)’ | | 


1—4k,(2a,) 1”? 


O,R~6B +2a,—12k,(2a,)'”” (16.136) 
We note that 


- 4,1 ~a,\/ 
q=r+O (exp k ,. O=R+O exp k : 
Ι i 


In the near-linear limit changes in 8 propagate primarily on the fastest 
characteristic (velocity R), whereas a and k are carried primarily by the 
two slower ones. By contrast, in this limit B is carried primarily by the 
slowest characteristic (P), and a, and k, propagate out ahead. In the 
forward region we may integrate along the P characteristic and deduce 
that g+r remains equal to its initial value. But in this limit g=r 
throughout so that g and r individually remain equal to the initial value. 
The normalization would usually be such that g=r=0; hence they remain 
zero. Then from (16.134) we have 


β--2κ,(2α,}" (16.137) 
and 


p~a,; q,r~0; U~2a;; O,R~2a,. 


The corresponding approximate equations can be shown to be 


k,,+(2a,k,), =0, 


(16.138) 
a,,+2a,a,,=9. 


In this approximation, the system is not strictly hyperbolic, but a, may be 
found first by integration along the characteristics dx /dt=2a,, and then k, 
can be found by integration along the same characteristics. The structure is 
similar to the one found in the linear theory. However, this time it is a, 
that remains constant on the characteristics and k, decreases like 1/t. 

As in the case of the linear limit, the next order approximation beyond 
(16.137)(16.138) modifies the structure of the equations; the characteris- 
tics are separated and the system becomes truly hyperbolic. 
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The equations in (16.138) are so simple by now that one would assume 
there is some direct derivation without going through the general case first, 


and this is indeed true. The Korteweg-deVries equation may be written in 
conservation form as 


η,-Ὁ (307+ η,.},5Ξ0, (16.139) 


and if it is averaged over a number of waves we have 


(7), + (3?) =0. (16.140) 


Now a solitary wave with g=r=0 and p=4, is given by 


ay 
n=a,sect? | ($1) 


If the average values are calculated from 


1/2 


x42) ἡ. (16.141) 


7= ki fon dx, n= kif παν, 


using this solution, we have 


een ne — _ 16k, (a,\°”” 
= (3) i (J) 
Equation 16.140 becomes 
(k,a}/?), +(2k,a}/*), =0. (16.142) 


From (16.141), the phase velocity U=2a,; therefore the consistency equa- 
tion k,,+(k,U),=0 becomes 


Κι, +(2a,k,),=0. (16.143) 


This pair for k, and a, is equivalent to the pair in (16.138). Notice that in 
this derivation it is implicitly assumed that g and r remain equal to zero in 
the modulation. 

An important particular solution of the equations is 


x -.1 fx 
a= 3 = 3-3) (16.144) 
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where f is an arbitrary function. This is easy to interpret. A solitary wave 
of amplitude a, moves with velocity 2a,. Therefore (16.144) represents a 
sequence of solitary waves each retaining a constant amplitude and mov- 
ing on the path x=2a,t. The decrease in k, is due to the divergence of 
solitary waves of different amplitude. The solution is represented in Fig. 
17.1, where it is deduced from the discussion of exact solutions. However, 
it is completed there by a discontinuity in a, and k,. We therefore consider 
what the jump conditions are for our equations. 

There is the usual question of which conservation equations should be 
maintained across the discontinuity. If we accept (16.142)-(16.143), the 
jump conditions are 


—V[k,al/?] + [2k,a}/?] ΞΟ, 


~V[k,]+[2a,k,] =0, 


where V is the velocity of the discontinuity. A jump from a,=0 to a 
nonzero value a would therefore have V=2a{. This is the phase 
velocity and the result indicates that the solution (16.144) may be cut off at 
any one of the solitary waves in the sequence. It is this choice that is 
confirmed by the exact solution in Section 17.5. The function f and the 
amplitude a can be determined only from the initial conditions, which 
are provided in Section 17.5. Of course the exact analysis is superior to the 
modulation theory of solitary waves in the case of the Korteweg-deVries 
equation. But the confirmation of the results in this case substantiates 
similar uses of the modulation theory in problems where exact solutions 
are not known. 

It might be noted finally that if the solitary wave solution is written 


and it is used to calculate an average Lagrangian defined by 
foe) 
ἢ Ξε κ᾽ { Ldx, 
we find 
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The variational equations are 


δα: &, =2a,k,, 


0: (a}/?),+( 207) =0, 


x 


Consistency: k,,+w,,=0. 


These are equivalent to (16.142)-(16.143). 


CHAPTER 17 


Exact Solutions; Interacting Solitary Waves 


17.1 Canonical Equations 


One of the most remarkable developments in recent work on non- 
linear dispersive waves is the discovery of a variety of explicit exact 
solutions for some of the simple canonical equations of the subject. The 
main equations concerned are the following: 


1. The Korteweg-deVries equation, now normalized to 
ῃ, + σηη, + Nexx = 9; (17.1) 


where o is a constant, 
2. The cubic Schrédinger equation 


iu, +u,, +v|ulru=0, (17.2) 
3. The Sine-Gordon equation 
Dit — Py, + Sing = 0. (17.3) 


Explicit solutions representing the interaction of any number of solitary 
waves can be constructed, and a precise prediction can be made of the 
number of solitary waves that will eventually emerge from any finite initial 
disturbance. 

These equations are canonical for the subject in that they combine 
some of the simplest types of dispersion with the simplest types of 
nonlinearity. The Korteweg-deVries equation combines linear dispersion 


ωΞ -- κ' (17.4) 


with a typical nonlinear convection operator. Equation 17.2 combines 
dispersion represented by 


w= K? (17.5) 
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with a simple cubic nonlinearity. In both cases, the dispersion relation can 
be viewed as a Taylor series approximation to a more general dispersion 
relation, (17.4) referring to those odd in x and (17.5) to those even in x. [A 
term proportional to « added to (17.4) or (17.5) can be normalized out by 
choosing a moving frame of reference.] For this reason the equations are 
not mere models but frequently can be derived as valid approximations for 
long waves. The linear dispersion relation w?=x*+1 in (17.3) is still a 
fairly obvious general form, apart from its original relevance to relativistic 
particles in the Klein-Gordon equation; a number of problems in which. 
the appropriate nonlinear term is sing were noted in Section 14.1. 

Ever since the Cole-Hopf solution of Burgers’ equation, countless 
people must have tried similar tricks to solve (17.1), but the eventual 
method of solution requires much more than a simple trick. Gardner, 
Greene, Kruskal, and Miura (1967) developed an ingenious series of steps to 
tie the equation to an inverse scattering problem. The end result is the 
transformation of (17.1) to a linear integral equation, but it seems incon- 
ceivable that anyone would discover it without intermediate steps. By 
hindsight, one can see that the substitution 


on = 12(logF) x. (17.6) 
which is a reasonable generalization of the substitution 
c= —2p(logg), 


for Burgers’ equation, offers an easy route to the special solutions repre- 
senting the interaction of solitary waves. The equation for F is not linear 
but it has some special structure, and solutions in series of exponentials 
lead to solitary waves. However, it is not clear how to extract more general 
information from the equation for F. 

Equation 17.2 was solved by Zakharov and Shabat (1971) by a similar 
inverse scattering technique, relying to some extent on the general ideas of 
Lax (1968). 

The explicit solution of (17.3) for two interacting solitary waves was 
first noticed by Perring and Skyrme (1962), apparently on the basis of their 
numerical computations. Lamb (1967,1971) subsequently showed how a 
Backlund transformation could be used consistently to produce further 
solutions. Recently Lamb (1973) and Ablowitz et al. (1973) have shown 
how inverse scattering theory can be used. 

The surprising overall result is that if a number of initially well-spaced 
solitary waves are allowed to interact, they will eventually emerge from the 
interaction and recover their original shapes and velocities. The only relic 
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of their interaction is a constant displacement from the positions they 
would otherwise have had. The analogy with the collision of particles is 
intriguing. The main solutions and methods of derivation will be described 
in this chapter. | 

We add two related topics. Toda (1967a, 1967b) considers mass-spring 
chains which are discrete versions of some of our problems. If the exten- 
sion of the nth spring from its equilibrium length is 7,(¢), the equations 
may be written 


mi, = 2f(r,.)—Lta+) ~fUa—v> (17.7) 


where f(r) is the force law for each spring. The continuous limit of this 
difference equation would be a wave equation, nonlinear if f(r) is a 
nonlinear function of r. In the nonlinear case the existence of uniform 
wavetrain solutions of (17.7) can be shown on the basis of a Stokes type of 
expansion in powers of amplitude. But Toda found ingenious exact expres- 
sions in elliptic functions for the case 


f(r)=—-a(l-—e-*”). (17.8) 


Moreover, these solutions have solitary waves as limiting cases and Toda 


was able to find solutions representing interactions with behavior similar to 
that of the continuous ones. 
Finally, the equation 


(1 -- 7) 0. +29, 0: Pe — (14 92) Pn =0 (17.9) 


was proposed by Born-Infeld (1934) via the variational principle 
Bf [{{-- φ τ 92} axar=o, (17.10) 


The idea was to extend the simple wave equation (with Lagrangian 
} g?—4 m2) and introduce nonlinear effects while preserving the Lorentz 
invariance properties. This equation admits waves of arbitrary shape 
moving with velocities of +1 or —1. They may be chosen as “solitary 
waves” but lack the specific intrinsic structure of the previous cases. 
However, Barbishov and Chernikov (1967) have shown that explicit solu- 
tions for interactions can be found, again with shape-preserving properties 
and a shift in position due to the interaction. A brief description 1s added 
even though there may be no deep relation with the others and they are 
not dispersive waves. 
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KORTEWEG-DEVRIES EQUATION 


17.2 Interacting Solitary Waves 


We first give the solutions derivable from the transformation (17.6). 
The parameter o can be normalized out of both (17.1) and (17.6), of 
course, but different choices corresponding to o=1,6, —6 have all been 
used in the literature and it is convenient to leave it open here for ease in 
making cross-references. The derivation of (17.1) for water waves and its 
more general significance through (17.4) as an approximation for long 
waves in other contexts have been explained in Section 13.11. 

The transformation from 7 to F is most easily made in two stages. 
First ἡ =p, is introduced and the equation integrated to 


] 
Pit Ps + Paxx Ξ0; 
then the nonlinear transformation 
op = 12(logF), 


is made. Terms up to the fourth degree in F and its derivatives arise, but 
the special feature of the transformation is that the terms of third and 
fourth degree cancel. The result is the quadratic equation 


ri es a sheer ee oe iy ee +3( Fe. ~ Fy Fix) = 0. (17.11) 


One notes the appearance of the basic operator 


0 93 
Or 3x2 


and a certain balance to the equation. A possible (but rather tenuous) 
motivation for the transformation starts from the solution for a single 
solitary wave, which may be written 


θ--θ 
on = 3a7sech’>——, θ-- αχ -- at, (17.12) 


where a and θ0 are parameters. This is the x derivative of 


6-8, 
6a tanh 5 7-1|} 
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which is in turn the x derivative of 12log F where 


F=1+exp{—(9—4))} 
θ 
=1+exp{—a(x—s)+a't}, c=. (17.13) 


This “derivation” focuses on one working rule for finding exact solutions 
in this area: to consider transformations which make the special solitary 
wave solutions appear as simple exponentials. One might note for com- 
parison that the transformation c= —2p(logg), for Burgers’ equation puts 
the steady shock solution (4.23) into the form 


p =exp(— a,x +vart)texp(—a,x+vazt), αἱ oe . (17.14) 


Whatever motivation is used, we see immediately that (17.13) 1s a 
solution of (17.11) for any @ and 5. It is a solution corresponding to the 
operator 


it satisfies F,+ F,,,=0, and the third pair of terms in (17.1 1) vanishes due 
to the homogeneity in the derivatives. 

If (17.11) were linear, we could superpose solutions with different a 
and s, but due to the nonlinearity there will be interaction terms. A normal 
interaction approach would be to take 


F=1+ FO+ FO+::- 
with the hierachy 


{FP + Fo}, =0 
t XXX J x ’ 
ἜΡΩΣ FO) = ~3{ FO” FO FO. 
t XXX Sx xXx x XxX. $ 
and so on, to solve. Let us take two terms like (17.13) for F™: 


FY=f+f, f=exp{—a(x—s) tat}, j=l,2. (17.15) 


The equation for F™ is 


2 
ID Dade eee em 3a a(a,— a) fi hr (17.16) 
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and its solution is 


FO = ef (17.17) 
2 l 


The surprising thing is that all the remaining equations in the hierarchy 
then have zero on the right hand side, so that 


FHl+f,t+f,+ — ae (17.18) 


is an exact solution of (17.11). 


The significant point in this solution is that the interaction terms 
produce ony the product term ἢ f, on the right of (17.16) and not terms in 
f? and f?, which could also be expected. This result generalizes to higher 
orders and the nonlinear terms in the equation never produce products of 
f’s containing a repeated subscript. Thus with an input of only two terms 
as in (17.15) the only needed combinations are f,,f,,f, f,, and we have an 
exact solution. If we start with 


N 
FOS 4, 


j=1 


then F contains all terms ff, with j#k but not f?; F® contains all terms 
Sid, with AKA! but not [ἢ or f7f,; and so on. Thus the sequence 
terminates at 


POO CTs: + fy 


(having covered all products without repeated f’s), and there is an exact 
solution in the form 


fal Pa > Gb 2 Sythe + tt Fay. whi Sy’ Sy: 
J JEKF 


i#k 


As if this were not amazing enough, it can also be shown that the solution 
may be written 


F=det|F 


ail? 


(17.19) 


Sec 17.2 INTERACTING SOLITARY WAVES 583 


where* 


Ε. -ὃ αν, 17.20 
= Ἐπ aa (17.20) 


This result was first found by the more general approach referred to above 
and described in the next section, but it may be verified directly in (17.11) 
(see Hirota, 1971). 

With N modes f,, the solution represents the interaction of N solitary 
waves. We discuss the case N =2. The solution for F is given in (17.18) and 
the corresponding expression for ἢ given by (17.6) is 


σ΄ ot? f, + 003 fy Ἐ2(α5-- 044) fy fy + ((α5-- αὐῬΔία;- 04)} (αξ ἢ 2+ at fi fz) 


ve (1+ fi+fot ((a2— a1)/(a2+ an) fife) 
(17.21) 
where 
f, =exp { —a,(x—5;) +art}. 
A single solitary wave (17.12) may be written in terms of f as 
Die π᾿ (17.22) 
I 5. 
where the maximum of on occurs for f=1. We note that 
Maximum amplitude of on =3a”, 
Position of maximum = 5 + at, (17.23) 


Velocity of the wave= a’. 


The solution (17.21) is approximately a solitary wave with parameter a, for 
regions of the (x,t) plane where f,=1 and f, 15 either large or small. To 


*There are a number of equivalent forms for the F,,,, which lead to the same final expression 
for 7. 
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verify this, observe the following: 
Ι. Forfi=l, <1, 


2 
ost 


N . 
2. (1+f,)° 
2. Forfi=l, f>1, 


oo { (02 a1)/ (a+ α")}᾽ αἱ Π9} 
- (fot {(0,-0,)/(a, +0) }7f,) 


κ 2 

ath mf Ao αι 

= 2 32°? i= a + fi. 
(1+/f,) 2: αἱ 


The latter is the solitary wave a, with s, replaced by 
2 
A, + a 
3,=5,- Δ tog{ 22": (17.24) 


this represents a finite displacement of the profile in the x direction. 
Similarly, where f,~1 and ἢ is either large or small, we have the solitary 
wave a, with or without a shift in s,. Where f,=1 and f,=1, we have the 
interaction region; where f, and f, are both small or both large we have 
on = 0. 

The behavior of the interacting solitary waves described by (17.21) 
can now be seen. We take a,>a,>0 for definiteness and note from 
(17.23) that the solitary wave a, is stronger and moves faster than the wave 
a,. As ἔτ τοῦ, there is no interaction region in which f,~1, f,~1, and 
(17.21) describes 


Solitary wave a, onx=s,+a7t, f,~l, f,<], 


2 

] ΑΔ αἱ 2 

Solitary wave a, on x= 5,-- — ἰορ᾽ --Ξ----  Ἑαδ, f,>1, ξ55Ξ|; 
a Ol, ~ αὶ 


elsewhere on ~0 (ἢ and f, both large or small). 
This represents a larger solitary wave a, overtaking a smaller one a). 
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As t+ 00, we have: 


2 
. | Ay αἱ 2 
Solitary wave a, on x=s,— —log] ——— | τα, f,=1, f,>1, 
6] 5 8] 


Solitary wave a, on x=s,+a3t, 41, f=1; 


elsewhere on = 0. 

The remarkable result is that the solitary waves emerge unchanged in 
form with the original parameters a, and a,, the faster wave a, now being 
ahead. The only remnant of the collision process is a forward shift 


2 
Ay + αἱ 
— lo Gena for the wave a,, 


2 
l A, + αἱ 
ἘΞ log( 2] for the wave a,. 


The interaction occurs in the neighborhood of 


2 2 
ΓΝ 52. 5] χα δι 12 
ν DA 2 " 2 Ὁ 


In this region f,~ 1, f,= 1, and (17.21) describes how the two peaks merge 
into a single peak and then reemerge in the reversed order. 

Similar results can be inferred from (17.19)-(17.20) for the case of N 
waves. The eventual behavior as t—»0o consists of N solitary waves lined 
up in order with strength and velocity increasing toward the front and 
separating further as ¢ increases. 


17.3 Inverse Scattering Theory 


In setting up the theory, we temporarily let 


u=— ra (17.25) 
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to agree with the original papers; the equation is then 


u,—6uu, + u,,, =9. (17.26) 


From the analogy with Burgers’ equation, the substitution 


dex 


uc 


is a popular first attempt toward finding solutions, but this alone does not 
lead far. Gardner, Greene, Kruskal, and Miura (1967) took it further with 
the choice 


um ΞΘ ἐλ 


and noted that rewritten this is the reduced Schrodinger equation 
W,.+t(A— uy =0. (17.27) 


At this point, presumably the emphasis changed and (17.27) was con- 
sidered not as a transformation that would produce a simpler equation for 
ψ, but rather as an associated scattering problem from which information 
on Ψ could be used to diagnose properties of u. In this view, the wave 
profile u(x,t) provides the scattering potential. The time ¢ appears as a 
parameter; there is a different scattering problem for each ¢. This time 
parameter is completely separate from the time 7 that might have been 
eliminated in reducing the wave equation 


Dx — Pz, - U(x, Dp =0 (17.28) 


to (17.27), via p(x,7,t) = W(x, De'’*". We shall return to (17.28) later, but 
for the present we adopt the reduced version (17.27). 

To use (17.27), we have to find the equation for ψ from (17.26). Since 
values of A would belong to the spectrum of the scattering problem (17.27) 
and the problem changes with ἐ, it is appropriate in the first instance to let 
λ be a function of t. When (17.27) is substituted in (17.26), it is found after 
a certain amount of ingenuity that the equation may be taken in the form 


y+ (0,-4,0}=0, (17.29) 


O=y,+,..—3(utrp,. (17.30) 
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We now restrict the discussion to solutions of (17.26) with u—0 as |x| 00 
and u integrable. Under this condition the spectrum of (17.27) is discrete in 
\<0O and continuous in A>O. For the point eigenvalues, A= —x?, the 
corresponding eigenfunctions Ψψ,, satisfy 


lv,|29, |x|, 


oo e 
f ψ2αχ =finite >0. 


Therefore integrating (17.29) from -- οο to oo, we deduce that A, 15 
independent of ¢. For the continuous spectrum we may choose a A>0 
independent of ¢ and consider the behavior of the corresponding solutions 
Ψ with ¢. In either case, we deduce from (17.29), with dA/dt=0, that 


Q =U, F Yee — 3(μ +A, = CY, (17.31) 


where C is independent of x. We now know that any solution y of (17.27) 
for fixed A will develop in time according to (17.31). In analyziny the 
eigenvalue problem (17.27), it is convenient to introduce A= μ and to 
work with a function x which satisfies (17.27) and the condition 


x~e'™, §u>0, aS x>+0. (17.32) 


In order that this function, introduced for t=0 say, should retain the same 
normalization (17.32) as it develops in time, we require (17.32) to be an 
asymptotic solution of (17.31) for all ¢. This requires the choice C= —4ip? 
and our pair of equations becomes 


Way + (pw? —u) p=, (17.33) 
Wet Were —3(ut p?) y+ 4ip*y =0. (17.34) 


We now have to explain how this curious formulation allows u(x,t) to be 
calculated. 

The method depends on the result that the scattering potential μ in 
(17.33) can be constructed from knowledge of the reflection coefficient for 
waves incident from x= + 00, together with certain information about the 
point spectrum. This is the inverse scattering problem, the original problem 
being to determine an unknown scatterer from its reflection properties. In 
the present context the required information on solutions Ψ is determined 
not from experiment but from the second equation (17.34). To be specific, 
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we consider the problem of finding u(x,7),t>0, given u(x,0). The proce- 
dure is as follows. For the given u(x,0), we first solve the eigenvalue 
problem (17.33), and determine the point eigenvalues w= ik,, the corre- 
sponding eigenfunctions y,, and the reflection coefficient 8 for incoming 
waves. One choice for the eigenfunctions is 


ψ,(χ)Ξ x(ik,, x), 


where x is specified by (17.32); the normalization constants are then given 
by 


na { [> var) 


As regards the reflection coefficient 8, we determine the solution Ψ of 
(17.33) with u(x,0), which has the following behavior at + oo: 


— ikx ikx 
γάρ) + B(k)e"*, x>+0, 


a(k)e~**, x—>— 0, 


for k real and positive. This determines the reflection coefficient B(k) and 
the transmission coefficient a(k). This is the direct scattering problem: 
finding k,,y,,8(k) for the given u(x,0). The inverse problem would be to 
determine u(x,0) from the knowledge of x,, y,, B(x). 

We now turn to the development of these solutions in time. We know 
that the x, are unchanged. From (17.34) and (17.33), 


ΡΞ 00 00 
“ἢ x? dx εἰ -χχι, τ 4xZ+ wx?) -ϑιμ᾽ [ χΡαν. 
ΞΟ — 0 
For the eigenfunctions, w=ik,, and ~,(x,)=x(x,t,ik,)-0 as x>+00; 
therefore the normalization constants are 
—f 


ο(ἡ-| [wa = 7,08", 


The solution V(k, x,t) for scattered waves will have some behavior 
W(k,x,t)~f(k, the" + 2(k,te™, x00, 


but this must be an asymptotic solution of (17.34) with »=k. On substitu- 
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tion we deduce that 


(κι πες, g(k,t) =B. 
The reflection coefficient is 


k, ᾿ 
b(k,t) = — = B(k)e**, 


The inverse scattering theory provides the construction of u(x, 1) from 


Κ,» c,(t), b(k,t). 


n 


In summary then, the direct scattering problem for u(x,0) determines 
k,»C,(0),b(k,0); (17.34) provides their development in time; the inverse 
problem determines u(x,t) from these quantities. 

Of course a major input is now the solution of the inverse problem. 
This is provided by the famous Gelfand-Levitan paper (1951) and its 
various extensions. Their paper is phrased in terms of determining the 
scattering potential u from the spectral function p(A), which has jumps of 
magnitude c, at the point eigenvalues \= — x? and a continuous spectrum 
O0<A<oo related to b. Kay and Moses (1956) and Marchenko (1955) 
provided direct constructions from k,,c,,b; a thorough review is given by 
Faddeyev (1959). The result is that 


.. χα 
u(x,t)= 2: K(x, x,t), (17.35) 
where the function K(x,y,f) satisfies the linear integral equation 
K(x,y,t)+ Blx+y,t)+ [ K(x,z,t)B(zt+y,t)dz=0,  y>x, (17.36) 
in which 
B(xt+y,1)= Σὲ c,(dexp{ —«,(x+y)} 
ae f ” b(k, dexp{ik(x +y)} dk 
ps a ed 
= >i 7,exp{ — κι(χ +y) + 827} 
ΓΙ -- [ Blbexp{ik(x+y) + Bik} dk; (17.37) 


the initial function u(x,0) provides the appropriate κ,, y,,8(k). 
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The derivation of (17.35)-(17.37) from the spectral approach requires 
more extensive discussion than would be appropriate here. But Balanis 
(1972) has shown that, formally at least, the results can be obtained very 
simply by working with the “unreduced” equation (17.28) and diagnosing 
the potential u(x, ἢ) from its reflection properties for an incident δ function 
instead of an incident periodic wave. Essentially the idea is to work in an 
equivalent time domain instead of the frequency domain. Balanis’ treat- 
ment of the inverse scattering can be included in a reappraisal of the 
method of solution from this point of view. 


An Alternative Version. 
We consider (17.33) and (17.34) as the Fourier transforms of 
Prxx — Pre — UY =O, 
φ, ἘΦ, , τ 3UPy +3. Pre — 4Proe = 9, 


coupling the function u(x; ἢ) with a function @(x,7; 1) such that 


φίχ,τ; =f ψάανμ; ἡ εἰσ dp. 


This pair of equations may be written more symmetrically as 


Μφξξφῳ,, -- Pre — UP =O, (17.38) 
Np=¢q,+ 0°p —3udg =0, (17.39) 
where 
302 0 
a 0x τ 


It is straightforward to show that 
(NM — MN )p= — (u,—6uu, + u,,,. )p+3u,Mo@. 


Therefore (17.38)(17.39) imply the Korteweg-deVries equation and we 
have a direct argument for their adoption. We now proceed by analogy 
_with the previous approach. The behavior of m at x = +o determines the 
scattering potential μ in (17.38) from Balanis’ version of the inverse 
scattering problem in the (x,7) plane. The development of φ with ¢ is 
provided by (17.39). 
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We now give Balanis’ argument for (17.38) and for the present do not 
display the parameter ¢. Consider an incident wave p= δ(χ Ὁ τὴ from 
χε +00 and let the reflected wave be B(x -- τ). That is, 


P~Y. =9(x+7)+ Β(χ-- τ) as xXx—>+00. (17.40) 


We propose that the corresponding complete solution of (17.38) may be 
written 


(2,7) πφωί τ) [ “Κ(α,ξ)φωί(ξ, τ)άξ. (17.41) 


(This is equivalent to a crucial step in the Gelfand-Levitan work.) By direct 
substitution in (17.38) we verify that there is such a solution provided 


K,.— K,,, + u(x) Κα, ox, 
u(x) = —2 K(x, x) (17.42) 
dx "ἢ 
K, Κὶ —0, E—> 00. 


This is a well-posed problem, therefore K exists. From the causality 
property of the wave equation (17.38) we know that φ must be equal to 
zero for x +7 <0. Hence 


palxt)t [Κα θφω(ξτ)άξτο, x +7<0. 


Introducing the expression for φῳ in (17.40), we have 


B(x—1)+ K(x,—1)+ [ ” K(x,£)B(E—1)dE=0, x +7<0. 


With r= —y, this is the Gelfand-Levitan equation (17.36). 

To incorporate this into the solution of the Korteweg-deVries equa- 
tion, we note that the development of the function B with ¢ is given by 
(17.39). But at x = +00, u-0; therefore B satisfies 


Β...-- B,,=0, 
(17.43) 
B,+0°B=0. 
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For =0, B is determined from the direct scattering problem for (17.38) in 
terms of u(x,0) as 


B(x—1)= Dy, exp{—4,(x—7)} + aq J Blk) exp {ik(x—7) } dk. 


The solution of (17.43) for t>0 is 


B(x—1,t)=> y, exp{ -- κι (α -- τ) Ἐ 8κβι) 


᾿ ag [80 ἐχρ{ ik(x—1) Ἐ 8,1} dk. 


With 7 = —y, again, this is exactly (17.37). 

Apart from its speed, this version gives a more symmetrical look to 
(17.39) as opposed to the rather awkward (17.34), and it brings out more 
clearly the mapping to a simpler linear problem (17.43). The basic disper- 
sive operator 


The extended form shows why the factor 8 appears in the t dependence in 
(17.37). 

We refer to (17.35)-(17.37) as the solution even though the linear 
integral equation (17.36) is still difficult to handle in general. However, 
various results can be obtained. First, the special case B(k)=0 can be 
solved explicitly and gives the interaction of solitary waves discussed in the 
last section; each point eigenvalue corresponds to a solitary wave. Second, 
the number of solitary waves ultimately emerging from an arbitrary initial 
disturbance u(x,0) can be determined from its spectrum. Third, it may be 
shown (Segur, 1973) that the contributions of the continuous spectrum to 
u(x,t) die out for large ¢. The first two topics are considered in the next 
sections. 
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17.4 Special Case of a Discrete Spectrum Only 


In this case (17.37) may be written 


B(xt+y)= > g,(x)h,(y); 


where the explicit dependence on ¢ is not displayed. The factor exp 8x°t 
may be taken in g, or A, or divided between the two. The solution of 
(17.36) may then be taken in the form 


K(x,y) = Sw,(x)h,(y); 


and we have 
W(X) + 5,0.) + > wn) {Bal 2)Fn(2) de =0 
If the matrix P(x) is defined by 
Pan(X)=Smmn + [ 4. (2)},(2) a, 


and f, g, h denote column vectors with components fo Sms Ym We have 


w(x)=—P~'(x)g(x) 
and 


K(x,x)=h™(x)w(x)=—h?(x)P7'(x)g(x). 
Since 


a —_—_—— 
dx Pan(X)= Bm( x )h, (x), 
K(x,x) may be expressed as 


= ἐπ᾿ 
K(x,x) = Trace { P 7: 
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where |P| denotes the determinant of P and ?,,, is the cofactor of P_,. 
Therefore 
d2 


d 
Reg Seale = —2“ _log| PI. 
u 27 K(x) ΠΣ og|P| 


Since u= — on /6, this is the transformation (17.6) and it remains to verify 
that |P| agrees with the function F quoted in (17.19)(17.20). 

There are various choices that lead to the same expression for u. If we 
express B in (17.37) by taking 


8,..(Χ) =YmPXP(— Kk, xt+8x,t), h(x) =exp(—«,x), 
we have 


Ym EXP — (yy + Ky) X + 8432} 


min mn 
Kk, + K,, 


Exponential factors may be taken in or out of the determinant without 
affecting the final expression for u; log|P| converts them into additive 
terms linear in x, which are eliminated by the double derivative in 
deducing u. Therefore, if each column of P is multiplied by e“* and each 
row by e ™*, we have the equivalent form 


Yn XP( —2k,,x +83 t) 


K+ Κη 
This agrees with (17.20), with 
Onn = 2K ns Yn = mn EXP By Sins 


A symmetrical form for P is obtained from 


g,(x)=h,,(x) = 1/7 exp(—«,,x +431), 
leading to 
(Yn) 
Pin = δι. + EXP { -- (Ky +) 4{κΆ +E) EY. 


mn mn 
K,, + K,, 
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17.5 The Solitary Waves Produced by an Arbitrary Initial Disturbance 
To determine the solitary waves emerging from an initial distribution 


1 =No(x) we have merely to find the point eigenvalues of the Schrédinger 
equation 


Wey t [A= U(x) } =O, (17.44) 
where 
nal = —Uo(x). 


After emerging from the interactions the solitary wave corresponding to 
\= —«? will be given, according to (17.12), by 


—u= τ =a,sech?(x,x—42t+constant), αἀ, Ξ2κ'. (17.45) 
Some specific examples will now be noted, quoting results for the eigen- 
value problem that may be found in most standard books on quantum 
theory. 


1. u(x) = — 08(x). 


If O >0, there is one point eigenvalue x= Q/2. Hence a single solitary 
wave is produced. The amplitude of u in (17.45) is Q7/2. If Ο «0, there 
are no point eigenvalues and no solitary waves. 


2. Rectangular Well. 


If uo(x) is a rectangular well of width / and depth A, the eigenvalues 
must satisfy (Landau and Lifshitz, 1958, p. 63) 


2§ 


singé=+ τ’ tané<0, (17.46) 


OT 


cosé= + tané>0, (17.47) 


ς᾽ 
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1/2 l K2 
5.- 4111, ἐξέ 5 πὰ >0. 


The number of eigenvalues is controlled by the parameter 5. As S 
increases, corresponding to stronger initial disturbances, the number of 
solitary waves increases. For all S >0 there is at least one solution of the 
eigenvalue equations; hence there is always one solitary wave. For small S, 
it is a solution of (17.47) with 


where 


~ » m 1 ς 412,2 ey: 
g=7, K=ySAM, a 5 SA, SKI. 

As S increases a second solitary wave is produced when Δ reaches π, 
and a solution of (17.46) first appears αἱ £=7 /2. At this value 


€,=0.934, «= 0.8044 1/2, ae 
—=— is 


i=2 


ps k=0, a, =0, 


More solitary waves come in as S increases; the number N is given by 
; S 
N=largest integer < — +1. 
π 


The dependence of S on A and / for the rectangular well suggests that, 
more generally, 


oO 
Ζ- [[μ", αν 
— & 


will be an interesting measure of disturbances and wave shapes in this 
context. Indeed, if we calculate this quantity for the single solitary wave 
(17.45), the parameter « drops out and we have 


Z= i. \ul!/2 dx =2'/ 2 (17.48) 


independent of amplitude. In this measure, therefore, solitary waves have a 
unit size. For a train of N solitary waves we have Z=2!/27N; there is a 
Planck’s constant for solitary waves! 

The parameter S is the value of the integral in the initial disturbance. 
For large δ, we have S~aN from the above results, so that in the large 
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time behavior the value of Z in the train of solitary waves is 
Z=2qQn~2S. (17.49) 


This shows the close relation between the ultimate Z and the initial S. But 
we also see that the “action” f|u|'/*dx is not conserved. This is also seen 
for small S: there is always one solitary wave produced, even when S is 
smaller than the unit required in (17.48). The required value must be built 
up in the initial separation process, possibly with compensating drain on 
the continuous part of the spectrum. We shall find, however, that the result 
N~S/a, for large S, is a general one for initial disturbances u,(x) 
consisting of a single well and S defined by 


5- [ἢ up| !/2.dx. (17.50) 


For the 6 function, case in the first example we may take uy as the limit of 
—Q(m/)'/ *e~™* as m->00. For this, S->0 as moo and the production 
of only one solitary wave fits in with the results for the other examples. 


3. μρΞ — Asech’x/I. 


In this case the eigenvalues are given (Landau and Lifshitz, 1958, p. 
70) by 


τς Αἱ 21/2 
Kn = 5} (4415 —(2n—1)}>0. 


The value of S defined by (17.50) is 
S=7A'/?]. 


The number of solitary waves is given by 


1/2 
2 
N = Largest integer < au I+ =i +1}. 
2 1 


As before, there is always one solitary wave for small δ, more come in as S$ 
increases; as 5- 900 we again have 


N~—, 
7 


and (17.49) holds. 
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4. Continous Distribution of Solitary Waves. 


When the initial disturbance is large (S— oo), there are many closely 
spaced eigenvalues which satisfy the Bohr-Sommerfeld rule 


hp dx = ᾧγλ-- u(x) dx=2n{ n+ 5 (17.51) 


(see Landau and Lifshitz, 1958, p. 162). Hence the number of solitary 
waves (the largest value of n for \=0) 15 
De earn) oe me | 
Ν-- τ J led dx= =. (17.52) 
This proves that the result found in the last two examples is general. 
The largest value of |A| for the bound states in (17.51) is u,,, where 
Up, = |Uglmax SO the range of κ is O<«<u}/? and the range of amplitudes in 
(17.45) is 


O0<a<2u,. (17.53) 


The number of eigenvalues in (A,A+ dA) is approximately 
2 φ dk ay 
VA — μρ(χ) 


Therefore the number of solitary waves with amplitudes in (a,a+ da) is 
approximately f(a)da where 


f(a) = (17.54) 


hy φ dx 
St Viul— 4/2 
a result first found by Karpman (1967) (see also Karpman and Sokolov, 


1968). This distribution is over the range 0<a<2u, and the total number 
is 


= ZU " ] co 1/2 
N= f(a) da= ole dx, 


in agreement with (17.52). 
After the initial interaction each solitary wave of amplitude a moves 
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with velocity 2a and it is found at 
x =2at as [t— 00. 


Therefore the distribution of amplitude is given by 


x ν 
ee εν Sie (17.55) 


We have the triangular distribution noted in Fig. 17.1 and discussed in 
Section 16.16. 
The number of waves k(x,?) in (x,x + dx) is given by 


k dx = f(a) da; 
hence 


k(x,t) = ++), (17.56) 


where f is given by (17.54). This fixes the arbitrary function (16.144). 


Su mt x 
Fig. 17.1 Series of solitary waves in solution of the Korteweg-deVries equation. 
17.6 Miura’s Transformation and Conservation Equations 
The path to. finding the preceding solutions of the Korteweg-deVries 
equation was very much stimulated by the existence of an infinite number 


of conservation equations. One way of obtaining these is by Muiura’s 
transformation, which is of independent interest. If one substitutes 


u=—— = 07 +0, (17.57) 
in the Korteweg-deVries equation, the result may be written 


0 
(2. tae ) (v,—6v7v, + 0,,,) =0. 
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Hence the equation 
v, —6v07v, + v,,, =0 (17.58) 


may also be studied by relating it to the Korteweg-deVries equation. 
The infinite number of conservation laws can be generated by a 
modification of this. If the substitution 


δὴ τ τ ἐνῆν τρῶν: (17.59) 


is made, then 


ες Ὁ. 1 2 1 ) = 
(iti d+ pew?) | wt (wt Bes et Wee 0. 


We choose w to satisfy 
L ay? =0 
Ww, + WS » + W,,, =U. 
A simple conservation equation for w is 


ne (50+ Sew Wer] =0. (17.60) 


If now w is solved recursively in terms of ζξε ση using (17.59), we have 
formally 


= Dew, (Ὁ), 


where the w, depend on ¢ and its x derivatives. When this is substituted in 
(17.60), each coefficient of <” provides a conservation law. The first few 
conserved densities are 


lye 1,3 _ +2 Tyg ayer, Θ μὴ 
ζ, 58 3 38 x? 48 388+ 5 xx? 
1,5. gpap2, 30422 [08,2 
ΓΕ 6¢ Co+ 5 ζς 35 χΧΧχ᾿ 


The existence of an infinite number of conserved quantities 


fowls} 


evidently added confidence that explicit solutions would be found. 
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CUBIC SCHRODINGER EQUATION 


17.7 Significance of the Equation 


The relevance of the equation 


iu, + u,,. + v|ul?u=0 


as an approximation for modulated beams in nonlinear optics was ex- 
plained in Section 16.4. Here we remark on its general significance for 
time-dependent dispersive waves. The general solution for a linear disper- 
sive mode is 


iz Ε(κ)ε" τοι ἀκ, (17.61) 


where w=w(k) is the dispersion relation. For a modulated wavetrain with 
most of the energy in wave numbers close to some value ky, F(k) is 
concentrated near k =k, and (17.61) may be approximated by 


Φ-Ξ f F(k) exp ik - {i + (k — Κρ)ωρ Ἐ +(k - hoe ἢ dk, 


where Wy) = (Ko), Wp ='(ko), ---. This in turn may be written 
ᾧ = pexp{ i(kgx — wot)}, (17.62) 


where 
oO 
g= [ F(ky+«) exp| ἱκχ — κω, p16), dk 
— 00 


and we have substituted k=k,+«. The function φ describes the modula- 
tions in (17.62); it satisfies the equation 


i(g, + wop,.) + 70 Dx = 9, (17.63) 
and it corresponds to the dispersion relation 


W = Kuyt 5 006, (17.64) 
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The equation for ® corresponds to the original expansion 


w= tat (k— kyla + 5 (k= ko) 06 
it is 
] 


'Φ, τἰω- kywo+ 5 K3w6}0+ (ως — Kywg )®, — ποῦ, =0. 


The extra terms are eliminated by the transformation (17.62). 
If this approximation to the linear dispersion is combined with a cubic 
nonlinearity we have 


i(@, +wipp,)+> 700 Pax + al ely =O. (17.65) 


Since p= ae'**—'™ is still a solution, we see that the nonlinear correction 
to the dispersion relation modifies (17.64) to 


W=kwot+ εἰ wy -- ga’. 


2 


Therefore the modulations are stable or unstable in the sense of Sections 
14.2 and 15.3 according to 


χω <0: Stable, 
φως >0: Unstable. 


Equation 17.65 can be normalized by first choosing a frame of 
reference moving with the linear group velocity wo to eliminate the term in 
gy, and then rescaling the variables to obtain 


iu, + u,,. + v|ulru=0, (17.66) 


where the sign of ν is the same as qu. 


17.8 Uniform Wavetrains and Solitary Waves 


These are found as usual by looking for solutions depending on a 
moving coordinate X = x — Ut, with the slight extension here that we allow 


e irx— ist v(X), X=x— Ut, 
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where r and s are constants. This may be interpreted as a slight flexibility 
in the choice of the exponential factor in (17.62). On substitution, the 
ordinary differential equation for v is 


ο΄ + i(2r— U)v' +(s— 7?)ot v|v|*v =0. 


We now choose 


the first being the important one to eliminate the term in v’. Then v may be 
taken to be real and 


ο" —avt+ pv’ =0. 


This gives a typical cnoidal wave equation for v. It may be integrated once 
to 


v 
v?*=A+av?— ae 


which can be solved in elliptic functions. The limiting case of the solitary 
wave is possible when » >0; we take A =0, a> 0, and the solution is 


4 \'? 
o= (22) sech a!/*(x— Ut). (17.67) 


This solution represents a wave packet for μ similar to that shown in Fig. 
15.2; it propagates unchanged in shape with constant velocity. 

It is interesting that |u|? is proportional to sech’, the same function 
that describes solitary waves for the Korteweg-deVries equation. The 
important difference here, however, is that the amplitude and the velocity 
are independent parameters. 

It should be particularly noted that the solutions (17.67) are possible 
only in the unstable case y > 0. This suggests again that the end result of an 
unstable wavetrain subject to small modulations is a series of solitary 
waves. It is confirmed by the analysis of Zakharov and Shabat described in 
the next section. 


17.9 Inverse Scattering 


Zakharov and Shabat (1972) in one more ingenious paper in this field, 
show how, following Lax (1968), the inverse scattering approach may be 
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applied in analogy with the method for the Korteweg-deVries solution. The 
method may be used on any equation 


u,= Su 
where the equation is equivalent to the relation 
OL _. Ἐπ 
9, “2 41ΞΊ(14 - AL), (17.68) 


where 7, and Α are linear differential operators including the function 
u(x,t) in the coefficients and dL / dt refers to differentiating u with respect 
to ¢ in the expression for L. Once this factorization is obtained (a highly 
nontrivial step), the method proceeds as follows. 

Consider the eigenvalue problem 


Ly =p. | (17.69) 
On differentiation with respect to t we have 


da, W_, OL 
iy, τ ἰλ ry = iL, +i 


= i Ly, — (LA—AL)y 
= L(ip,— Ay) +AAy. 


Therefore 


iy = (L-)) (it, — AY). 


If p satisfies Ly =Ay initially and is allowed to develop with ¢ according to 
ip, = Ad, (17.70) 


then y continues to satisfy Ly =Ay with unchanged A. Equations 17.69 and 
17.70 are the pair of equations coupling the function u(x,Z) in the 
coefficients with a scattering problem. The solution proceeds as before. 
The behavior of y determines the scattering potential in (17.69); the 
development of ψ in time is given by (17.70). 

The crucial step is still to factor L according to (17.68). Zakharov and 
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Shabat note, presumably by inspection, that matrix operators 


* 

=; l+p 0 Oo 0 u : p= “ΞΕ 
0 l—p Ox u 0 l—p 

lh iu* 

2 x 

Γ 1 0 vere l+p 

Ο 1 χ᾽ ! - 

— lu 
x l—p 
will do it! 


From this point their analysis parallels the Korteweg-deVries discus- 
sion, although a major modification is introduced in handling the inverse 
scattering problem for (17.69), since the operator L is not self-adjoint. In 
nonself-adjoint problems, the direct eigenfunction expansions are obtained 
from the singularities of the Green’s function in the complex A plane. 
Zakharov and Shabat use the development of ¥ according to (17.70) to 
obtain information on the singularities of the Green’s function and related 
functions, and then construct u(x,t) from this. However the alternative 
version working with the transformed equations analogous to (17.38)- 
(17.39) is much simpler. 

The results are similar qualitatively to those for the Korteweg-deVries 
equation. The solutions for interacting solitary waves are derived explicitly 
and are obtained when only the point spectrum contributes. The expres- 
sion for |u|? is again in the form 


tl -F~ Jog |P| 
Age S\i |, 


where |P| is a determinant of exponentials, related this time to the 
operator 
9. 0? 


bare ee 


The solution again confirms that solitary waves retain their structure and 
emerge in exactly their original form with possible delays due to the 
interaction. 
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The solution of the initial value problem is found as before and it 
seems clear that for large times the contributions of the point spectrum 
dominate. That is to say, the disturbance tends to become a series of 
solitary waves. The analysis is confined to solutions for which |u|—0 as 
|x|—> 00, but it seems fair to deduce that a series of solitary waves is the end 
result of the instability of wavetrains to modulations. 


SINE-GORDON EQUATION 


The physical problems in which the Sine-Gordon equation occurs 
were described in Section 14.1. The class of solutions with @ oscillating 
periodically about y=0 is included in the discussion there. We now 
consider more general solutions. In particular, since » is an angular 
variable, solutions in which @ increases by 27 in each cycle represent 
periodic solutions physically. Thus helical waves in which q continually 
increases are included as periodic wavetrains. A limiting case is a single 
kink with » changing by 27 from x = — οὐ to x=; this is a solitary wave. 
Solutions representing interacting solitary waves show the same preserva- 
tion properties of the previous cases, and the topological interpretation of 
the conservation of kinks is particularly appealing. 


17.10 Periodic Wavetrains and Solitary Waves 
The equation is taken in normalized form 
Pr τ Pxx + sinp =0, 

and the steady profile solutions y= ®(X), X = x — Ut, satisfy 


+ (U?=1)®%-+2sin? 5 O= A, 


where A is a constant of integration related to the amplitude. The follow- 
ing cases can be distinguished. 


1. 0<A<2, U*—1>0. These are periodic solutions with ® oscil- 
lating about ®=0 over the range —-®,<®<®p, where 


1/2 
®)=2sin-( 5] | 
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2. 0<A<2, U?—1<0. These are periodic solutions with ® oscil- 
lating about ®=7 in the range 


1T-B,<O< 7+, 


3, A<0O, U?—1<0. These are helical waves with 


1/2 
oy=+| 2 {12.104} 
lou 


: 2 


® monotonically increasing or decreasing. 


4, A>2,U*-1>0. These are also helical waves with 


1/2 


2 “ΠῚ 

ΞΞ τ { -------.....-- — — 
P, {P(A 2 sin 52) 

5, Limiting case A=0, U?—1<0. Solutions are 
Φ 4. 172 
tan( P)= τρίτα τ ) (Χ -- Xq)}. 

They represent single kinks of magnitude 27. With both of the signs 
chosen positive it is a positive kink from ®=0 at x=— co to @=27 at 
x= +00; with both signs chosen negative it is still a positive kink but from 


= --2π at x=—oo to ®=0 at x= +00. Opposite signs give negative 
kinks. 


6. Limiting case A=2, U*—1>0. The solution is 


tan( 257) = exp{ +(U?— ) “ἃ - Χο) 


and this represents a kink between ®= — π and Φ Ξ- σ. 


The stability arguments of Section 14.2 and Section 15.3 showed that 
the periodic waves in case 1 above are unstable to modulations. These 
arguments can be extended to the other types of solution and show that 
cases 1 and 2 are unstable, whereas the helical waves 3 and 4 are stable. It 
should be remembered that these stability arguments apply only to rela- 
tively long modulations. They do not give a complete discussion of 
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stability nor do they cover the limiting cases 5 and 6. We should expect 
that case 6 is completely unstable since it requires ®= + 7 at infinity. In 
Scott’s pendulum model, for example, this would correspond to the 
pendula in the upright vertical position. We concentrate now on the 
solitary waves (case 5). 


17.11. The Interaction of Solitary Waves 


Perring and Skyrme (1962) apparently guessed that their numerical 
solutions for two interacting solitary waves could be fitted by 


sinhx(1—U?)7'/ 
Te aj acca Ξ (17.71) 
4 cosh Ut(1— U2)7'/? 


and then verified that this is an exact solution! To see that it represents the 
interaction of two solitary waves, notice that its behavior as t-+oo is 
given by 


x+ Ut x—Ut 
t—>— 0: Y~Uexp( 2 4UE ) Verp(- =U | 
V1—U? V1-U? 
x+Ut x= UL 
to+o: Ψψ--- Vexp(- -Σξῦν ἡ: verp( =U) 
V1-U? V1-U? 


Each of these represents solitary waves moving in opposite directions. The 
positive kink moving with velocity U is incident from x=—oo and 
emerges as a positive kink. The factor U outside the exponentials may be 
absorbed into the exponential as a displacement in x. The positive kink 
from — oo is displaced an amount 


2V1-v? log > 


by the interaction. 
The form involving ~=tan(q/4) suggests that the transformation of 
the equation to one for Ψ may be useful in general. The equation satisfied 


by wy is 
(+ ψήχψ, -- tee Ἐ ψ) -- WY? - 2+ W)=0. (17.72) 


This has a balanced structure and brings out the most nearly related linear 
operator 
ὃ 2 


------- 


2 2 
- -- - 17.73 
ὁ = ax? ( ) 
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It is reminiscent in these respects of the role played by the F equation 
(17.11) in the discussion of the Korteweg-deVries equation. The single 
solitary wave solutions of (17.72) are given by 


y= sex(+ aa (17.74) 


so that the transformation may again be stimulated by the rough working 
rule of finding the transformation for which solitary wave solutions be- 
come exponentials, and in fact satisfy (17.73). 

The Perring-Skyrme solutions can be found by separation of variables, 
that is, 


y=f(x) g(t), 


even though the equation for Ψ is nonlinear. The equation is satisfied 
provided 


f2=pftt+ (+a) f?—», 


gi = ng τλεῖ τ μ, 


where A, p, ν are separation constants. These have solutions in elliptic 
functions, of which the Perring-Skyrme solutions are special cases. 

However, a more consistent approach to interacting solitary waves 
was not based on (17.72), but was developed by Lamb (1967, 1971) using 
Backlund transformations. 


17.12 Backlund Transformations 


These transformations were introduced originally as generalizations of 
contact transformations and were associated particularly with studies of 
the geometry of surfaces. As remarked earlier (Section 14.1) the Sine- 
Gordon equation arises in connection with surfaces of Gaussian curvature 
“1. An account of Backlund transformations and their uses is given in 
Forsyth (1959, Vol. VI, Chapter 21). In the application to the Sine-Gordon 
equation, it is convenient to choose the normalized form 


d*p : x-t ba 
dean = sing, é= ΒΕ; «.. T= 7" (17.75) 


In general a Backlund transformation for a second order equation for 
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(én) is of the form 
φεΞ Ρίφ',φ,φ, φ,.ξ,η), 


φ, = Ο(φ΄,φ,φ,»φ,»ξ,η). 


Consistency of the two expressions leads to a new differential equation for 
φ΄(ξ,η). The philosophy seems to be to find interesting equivalent equations 
in this way. Indeed, the reduction of Burgers’ equation 


Py + PPe— γφες 
to the heat equation 


Pn τ PPzg=O 


can be written as a Backlund transformation 


/ / 


, PY , φ 
=~ a, P= -(2νφ,--φἤς. 


In general, however, the reduction of the original equation to a linear 
one is perhaps too much to hope for. But another use is to find equations 
that can be mapped into themselves, so that any known solution for φ 
(even a trivial one) may provide a new solution φ΄. The determination of 
the transformation which maps (17.75) into itself is set as a problem in 
Forsyth with a reference to Bianchi and Darboux. It is easily shown that 
the appropriate Backlund transformation is 


dy’ ὃφ _ pte 
Fy an 7 

(17.76) 
Og” op 2. φ'-φ 


where A is an arbitrary parameter. We note that they give, respectively, 


0, p+ 
On PE Pen Ἐλίφι + @,) COS 5 


y’— φ' Ἐφ 


= φρῃ + 2sin 


2 2..° 

d, [da φ'-φ 
DEP — Png + χίφε- H;) Cos 5 

= — Q,~+2sin cos =~ 
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The two expressions for φέ, are equal (and therefore consistent) if 
Pz, = Sing. 

Moreover, adding the two, we see that 
Pz, = sing’. 


Lamb’s procedure is to generate new solutions successively from (17.76). 
First, since φρξξῦ is a solution, another solution φι may be found from 
5 ual ὅφι = cea 
θη 2᾽ 0g A 75 
This is easily shown to be the solitary wave 
Fi _ ξ Ν x UE 
tan a Cexp(in+ x exp SUE - 2 ) 
Jef law 
r=! +U 
Next, if φ is taken to be 9, in (17.76), the solution g’= 9, is found to 
be the Perring-Skyrme result for two interacting solitary waves. In general 


the solution g,_, for n—1 solitary waves generates the solution g, for ἡ. 
In terms of p=tan(@/4), ψ' =tan(¢’ /4) the transformation (17.76) is 


w= (1+ y?) {1+ 02)yt ACW) y +917) }, 
wa (i493) {[-αὐνψὴν, εχ α- φὴνψ τ p99) | 


Either one is a Riccati equation for ψ', or, alternatively, y can be 
eliminated between the two to give a linear first order partial differential 
equation for y’. The solution for the latter can always be found in principle 
but the actual expressions for the gy, become successively more compli- 
cated. 


17.13 Inverse Scattering for the Sine-Gordon Equation 


Recently Lamb (1973) and Ablowitz et al. (1973) have shown how the 
inverse scattering approach may be used. The key step is the factorization 
into a scattering problem involving the desired solution φ and an evolution 
equation for the eigenfunctions. If the Sine-Gordon equation is taken in 
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the normalized form 
φ,, = sing 


(reverting to x and ¢ to make the analogy with the previous cases more 
apparent), the appropriate scattering equations are 


0D, ] 

ox + idv, = — 7% (%,1) Oy, 
dv, ] 

ae — iAv, = 5 Px(%5t) 04, 


and the evolution equations for the vector eigenfunction (v,,U>) are 


θυ) j 
aan 4λ (1 cosg + v,sing), 


dv, j 
"ag A (1 sing — 0, COos¢). 


The results of Zakharov and Shabat may be applied directly to reconstruct 
φίχ, ἢ. Again the alternative approach corresponding to (17.38)-(17.39) 
appears to be simpler. 


TODA CHAIN 


Systems of mass points with nonlinear forces between nearest neigh- 
bors are of interest as models for lattice vibrations in crystals, with 
questions of the partition of energy among the various modes of vibrations, 
of thermal expansion under excitation, and the like, being of physical 
interest. They may be viewed as spatially discrete analogs of the con- 
tinuous systems considered in this book. A single chain described by (17.7) 
is the simplest case for wave propagation. It is written in terms of a 
mass-spring system, but other interpretations are possible, such as propa- 
gation in a lumped transmission line as discussed by Hirota and Suzuki 
(1970, 1973). The latter authors have also performed experiments verifying 
the predictions on solitary waves and their interactions. 

In the linearized limit f(r) = — yr, (17.7) becomes 


MF, = VWTaait ly lly): 
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The traveling wave solution 
r, =acos8, 6=wt—pn 


is well known. It works because on substitution we have 


; | 
- πον πὸ δ POS) een (17.77) 


and the right hand side reduces to a multiple of cos@ from the sum and 
difference formulas for the trigonometric functions. We have the 
“dispersion relation” 


2 
τος =2(1—cosp) =4sin? 5. (17.78) 


The parameter p is analogous to the wave number for continuous lines. 
Stokes-type expansions of the form 


r,=acos8+a,cos26+-:: 


can be developed for small amplitudes a; they provide nonlinear correc- 
tions to (17.78) and modulation theory can be developed as in the con- 
tinuous case (Lowell, 1970). However, to obtain fully nonlinear solutions 
and, in particular, solitary waves is a much more difficult problem than in 
the continuous case. One would expect they exist but explicit examples 
would be welcome. The difficulty is that one requires functions and their 
addition formulas to handle the right hand side corresponding to (17.77). 
Toda (1967a, 1967b) has provided such solutions for the case 


f(r) =—a(1—e-*). (17.79) 


17.14 Toda’s Solution for the Exponential Chain 


It is convenient to convert the difference equation (17.7) into an 
equivalent form in which 5, =/f(r,) is introduced. First we have the system 


δι =L(a)> 


MP, = 25, τ Sn41— Sa—1 


(17.80) 
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For the exponential force (17.79), 
5,=f' (7, )%, = — aBe~ Pray 
= — B(at5,)s,. 
Therefore (17.80) may be combined into 


Sn 


Bees a ad (17.81) 


For a steady progressing wave 
s,= S(0), 0 = wt — pn, 
the function S(#) must satisfy the ordinary differential difference equation 


ω §” = eo 
ΓΒ atas?  S(OtP) +S p)-2S(6). (17.82) 


According to Toda, we now remember that 


424 | Tee | 


dn?(0+p)—dn(9—p)= —2 νος ἐδ θεώ 
a Cae a ap\ 1—k*sn*@sn%p 


where sn, cn, dn are the Jacobian elliptic functions and k is the modulus 
of these functions. If 


Ε(ζ)Ξ fan’: dz, 


the integral of this equation with respect to p is 


sn8cn@dn@ snp 


E(6+ p)+ E(@—p)—2E(0)= —2k? 
(9+ p)+ Ε(Θ-- »)--2Ε(9) 1 an κα, 


(17.83) 
Moreover, 

E'(0)=dn*0 = 1 — k?sn70, 

Ε΄(θ)- —2k?sn@cn@dné. 
Therefore the right hand side of (17.83) is 


E"(0) 


| 
τς | ee, 
q+ E'(6) i snp 
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We see that the equation for E(@) is essentially (17.82). The function E(@) 
is not periodic in 0, but the related Jacobian zeta function 
E(K) 
Z(0)=E(8)-8—— (17.84) 


has period 2K, and the change from E(6) to Z(@) modifies the equation 
trivially. The solution for s, may be taken in the form 


s,=S(0)=bZ(2K0), @=wt—pn, (17.85) 
where 
1/2 —1/2 
l E 
p= (2) aii + 8] 17.86 
B fo K ( ) 
1/2 —1/2 

gol ee ἜΝ" 

w= ra ΕῚ fers I+ ΕἸ ; (17.87) 


The expression for r, is determined by 


<a({ = p= 6h) j,=2Kub( dn?2K 2 =} (17.88) 


The functions Z(¢) and dn’ have period 2K; here the phase has been 
normalized so that one period corresponds to unit increase in # rather than 
the 27 of linear theory. The amplitude of Z({) is a function of k so the 
amplitude of s, is a function A(k,p). Combined with (17.87), we have a 
dispersion relation between w, p, A expressed in the parametric form 


A=A(k,p), w=w(k,p). 


In the linear limit k—0, we have 


sn%&~sin’ ¢, K,E~5 
ee en ree: | es 
dn*{~1—k*sin“§, K ] 7? 


z(¢)~ ἘΞ sin2¢. 
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2 
5,~ 7 sin 2nd, 0 =wt — pn, 
1/2 ab \'/? 
b~( 22) sin 7p, ee oe sin 7p, 
β πὶ m 
and 
2 
--α(1]-- ε΄ &)~— yr, ~ mens cos 276. 
These may be rewritten 
5,~A sin2a(wt—pn),  —yr,~27wA cos2a(wt—pn), 
2 1/2 
m(270 
me): ~4 sin? xp, 5.-- 4 ia 
Y ma sin 7p 


Chap. 17 


apart from minor renormalizations, the results reproduce the linear solu- 


tion. 


When k-1, K->oo and we must take the case of finite limits for 


2Kw—Q, 2Kp—>P. 


The elliptic functions have the limiting forms 
snf—tanh ¢, ἄη ζ- 560} ζ, Z(¢)—tanh ¢; 


the relations (17.86)-(17.87) become 
1/2 


1/2 
b+( 2 sinh P, a-() sinh P. 


Hence 


S,~ is tanh (Qz — Pn), 


2 
niger) sech?(t— Pn). 


These are the solitary waves. 


On the basis of various approximate forms and special cases, Toda 
builds up convincing evidence that these solitary waves interact as in the 
continuous cases with the original forms emerging after interactions. 
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BORN-INFELD EQUATION 


The Born-Infeld equation 
( ᾿Ξ 7) Pex + 2φ, P;Pxt = ( 1+ 92) Pn =0 


has solitary wave solutions of a kind, but they are very different from those 
previously discussed. It is a simple matter to check that either 


φεφί(χ-- ἡ) or φεφΦ(χτη) 


are exact solutions for any function ®. In particular the function ® may be 
chosen as a single hump to give a solitary wave appearance. But there is no 
natural structure involved. The equation is hyperbolic for solutions with 


1+ g?—97>0, 


and perhaps some aspects properly belong in Part I. Notice that the 
solitary waves have constant characteristic velocities + | and avoid the 
usual breaking expected for nonlinear hyperbolic waves. 


17.15 Interacting Waves 


The solutions of Barbishov and Chernikov (1967) can be obtained 
quite naturally by a hodograph transformation, although the simplicity of 
the transformed equation is unexpected. First, if new variables 


f=x-—t, ἩΞΕΞΧΈΙ, 
u= Pp v=, 
are introduced, we may take the equivalent system 
u,—v,=9, 
ὺ (17.89) 
v7u,— (1+2uv)u, + u*v, =0. 


The roles of the dependent and independent variables are then 
interchanged to give 


£,—7,=9, 
vn, + (1 + 2uv)&, + u7é, =0, 


(17.90) 
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or, equivalently, the single equation 
wet, + (1+2uv)é,, + v&,, + 2uk, + 2vé, ΞΟ. (17.91) 


Assuming that the relevant solutions are to be found in the hyperbolic 
regime, it is now a natural step to find the characteristics for the linear 
system (17.90) and the linear equation (17.91). They are the integral curves 
of the differential form 


u* dv? — (1+2uv) dudv + v* du? =0, 


and they are found to be curves r=constant, s=constant, where 


_ V1+4u0 --] _ V1+4uv ~1 
ale -- ee (17.92) 


If r,s are introduced as new variables to replace u,v, the equations (17.90) 
become 


ré+n =0, 


E+ s*n, Ξε. 


(17.93) 


The surprising result is that on elimination of ἡ to deduce the new form of 
(17.91), we have simply 


é =0. (17.94) 


The hodograph transformation guarantees a linear equation, but it could 
well have been an impossible one for practical purposes. 
The general solution may be taken as 


x~1=£=F(r)— [s°G’(s) as, (17.95) 
x+1=9=G(s)— [F(r) ar, (17.96) 


where F(r), G(s) are arbitrary functions. Since 


r Ss 
P, = Us, + On, = 78 + 7, 
=rF’(r), 


and, similarly, 


φ, Ξ 56σ'΄ (5). 


Sec 17.15 INTERACTING WAVES 619 


the corresponding expression for 9 is 
φ-- ΠΌΣΩΝ [sG"(s) as. (17.97) 
Finally, it is convenient to introduce 
F(r)=p, G(s) =o, 
r=®(p), s=®5(o), 


and take the solution in the form 


p=®,(p)+ ®,(9), (17.98) 
x-1=p- [ Φβ(σ)ασ, (17.99) 
xt1=0+ [- &2(p) Φ. (17.100) 

p 


If ®,(p) and ®,(o) are localized, say they are nonzero in 


—-l1<p<0), 0<o<l, 
respectively, then 


p=O(x—-1+0(x+21)  forr<0. 


The wave ®, is incident from x= — oo and the wave ®, from x= + oo. As 
t— +00, the solution approaches 


φεφίχ- ες [ O7(0)do] ἐφ, χ τις [ἡ ©P(p)do | (17.101) 


πο 


Each wave receives a displacement in the direction opposite to its direction 
of propagation equal to 


[ 2 ()ar. 


— © 


In this respect the interaction is similar in its residual effect to the other 
examples of interacting solitary waves. But in most other respects, these 
solutions and the Born-Infeld equation seem to belong to a different class. 

One final comment on the actual solution given in (17.98)-(17.100): 
the mapping from the (x,/) plane to the (9,0) plane may become singular 
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in the course of the interaction, even though (17.101) shows that the 
solution is again single-valued in the aftermath. This would be interpreted 
as shock formation and tied to the fact that during the interaction the 
characteristic velocities depart from the values +1 and may form en- 
velopes. In such cases the subsequent behavior would require modification 
and (17.101) should not be accepted as it stands. However, breaking would 
require sufficiently large amplitudes and there will be a range of solutions 
for which this does not occur. 

In this particular example, the linear superposition performed in the 
transformed equation (17.94) makes the preservation of the identity of the 
waves obvious. Indeed in all cases the preservation of structure can 
presumably be traced to linear superposition in some appropriately trans- 
formed space. But the nature of the mapping and the linear solutions 
involved is crucial. The solution for the confluence of shocks in Section 4.7 
corresponds to the superposition of solutions of the linear heat equation. 
But, because the latter solutions are exponential and not localized, the 
shocks combine to form a single new shock instead of passing through 
each other. 

In concluding this chapter one can only comment again on the 
remarkable ingenuity of the various investigators involved in these recent 
developments. The results have given a tremendous boost to the study of 
nonlinear waves and nonlinear phenomena in general. Doubtless much 
more of value will be discovered, and the different approaches have added 
enormously to the arsenal of “mathematical methods.” Not least is the 
lesson that exact solutions are still around and one should not always turn 
too quickly to a search for the e. 
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